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Methods for Measuring Physical Characteristics of Nucleic Acids by Microscopic 

imaging 

5 This invention was made with U.S. Government support 

under Contract No. HG 00225 awarded by the National 
Institutes of Health of the United States Department of 
Health and Human Services and the U.S. Government has certain 
rights in the invention. 

10 

1. FIELD OF THE INVENTION 

This invention relates to methods and compositions for 
manipulating and characterizing individual polymer molecules, 
especially nucleic acid molecules, according to, for example, 
^5 size and/or nucleotide sequence. 

2- BACKGROTIND OF THg T NVENTIQM 

The analysis of nucleic acid molecules at the genome 
level is an extremely complex endeavor which requires 

2Q accurate, rapid characterization of large numbers of often 
very large nucleic acid molecules via high throughput DNA 
mapping and sequencing. The construction of physical maps, 
and ultimately of nucleotide sequences, for eukaryotic 
chromosomes currently remains laborious and difficult. This 

25 is due, in part, to the fact that current procedures for 
mapping and sequencing DNA were originally designed to 
analyze nucleic acid at the gene, rather than at the genome, 
level (Chumakov, I. et al . , 1992, Nature 1^:380; Maier, E. 
et al., 1992, Nat. Genet. 1:273). 

3Q Traditionally, the separation and molecular weight 

distribution of nucleic acid molecules has been accomplished, 
most commonly, via gel electrophoresis (see, for example, 
Freifelder, 1976, Physical Biochemistry, W.H. Freeman), which 
involves moving a population of molecules through an 

35 appropriate medium, such that the molecules are separated 
according to size. Such electrophoretic methods offer an 
acceptable level of size resolution, but, especially for 
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purposes of high throughput mapping, suffer from a number of 
setbacks. 

For example, such techniques require the preparation of 
DNA in bulk amounts. With respect to genome mapping, such 
5 preparative procedures may require sources such as genomic 
DNA or DNA from yeast artificial chromosomes (YACs; Burke, 
D.T. et al., 1987, Science 216:806; Barlow, et al., 1987, 
Trends in Genetics 3:167-177; Campbell et al . , 1991, Proc. 
Natl. Acad. Sci. USA 88:5744). Obtaining quantities of DNA 

10 from these sources which are sufficient for detailed 

analyses, such as restriction mapping, is time consuming and 
often impractical. Further, because populations of moleculef 
of like size migrate through the medium at the same rate, it 
is impossible to separate individual molecules from within a 

15 sample of particles by utilizing such a technique. 

Additionally, while it is possible to resolve a wide size 
range of DNA molecule populations gel electrophoresis 
techniques, optimal techniques can often require the use of 
several different gel matrix compositions and/or alternative 

20 electrophoresis procedures, depending upon the sizes of the 
molecules of interest. For example, the separation of large 
molecules of DNA may require such techniques as pulse field 
electrophoresis (see, e.g. . U.S. Patent No. 4,473,452). 
Further, standard gel electrophoresis techniques involve the 

25 separation of populations of molecules according to size, 
making it impossible to separate individual molecules within 
a polydisperse mixture. In summary, therefore, the accurate, 
rapid, practical, high throughput separation of individual 
DNA molecules, especially those of highly disparate sizes, 

30 which would often be required for genomic mapping purposes, 
is impossible via gel electrophoresis. 

Techniques have been reported for the visualization of 
single nucleic acid molecules and complexes. Such techniques 
include such fluorescence microscopy-based techniques as 

35 fluorescence in situ hybridization (FISH; Manuelidis, L. et 
al., 1982, J. Cell. Biol. 95:619; Lawrence, C.A. et al . , 
1988, Cell 52:51; Lichter, P. et al . , 1990, Science 247:64; 

- 2 - 
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Heng, H.H.Q. et al . , 1992, Proc. Natl. Acad. Sci. USA 
89:9509; van den Engh, G. et al . , 1992, Science 2^:1410) and 
those reported by, for example, Yanagida (Yanagida, M. et 
al., 1983, Cold Spring Harbor Symp. Quantit. Biol. 4J:177; 
5 Matsumoto, S. et al . , 1981, J. Mol . Biol. 112:501-516); 
tethering techniques, whereby one or both ends of a nucleic 
acid molecule are anchored to a surface (U.S. Patent No. 
5,079,169; U.S. Patent No. 5,380,833; Perkins, T.T. et al . , 
1994, Science 2^4:819; Bensimon, A. et al . , 1994, Science 
10 2il:2096); and scanning probe microscopy-based visualization 
techniques, including scanning tunneling microscopy and 
atomic force microscopy techniques (see, e.g. . Karrasch, S. 
et al., 1993, Biophysical J. £5:2437-2446; Hansma, H.G. et 
al., 1993, Nucleic Acids Research 21:505-512; Bustamante, C. 
15 et al., 1992, Biochemistry 11:22-26; Lyubchenko, Y.L. et al . , 
1992, J. Biomol. Struct, and Dyn. 10:589-606; Allison, D.P. 
et al., 1992, Proc. Natl. Acad. Sci. USA 8^: 10129-10133 ; 
Zenhausern, F. et al . , 1992, J. Struct. Biol. ifl8:69-73) . 
While single molecule techniques offer the potential 
20 advantage of an ordering capability which gel electrophoresis 
lacks, none of the current single molecule techniques can be 
used, on a practical level, as, for example, high resolution 
genomic mapping tools. The molecules described by Yanagida 
(Yanagida, M. et al . , 1983, Cold Spring Harbor Symp. Quantit. 
25 Biol. 47:177; Matsumoto, S. et al . , 1981, J. Mol. Biol. 

132:501-516), for example, were visualized, primarily free in 
solution, in a manner which would make any practical mapping 
impossible. Further, while the FISH technique offers the 
advantage of using only a limited number of immobilized 
30 fragments, usually chromosomes, it is not possible to achieve 
the sizing resolution available with gel electrophoresis. 

Single molecule tethering techniques, as listed above, 
generally involve individual nucleic acid molecules which 
have, first, been immobilized onto a surface via one or both 
35 of their ends, and, second, have been manipulated such that 
the molecules are stretched out. These techniques, however, 
are not suited to genome analysis. First, the steps involved 
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are time consuming and can only be accomplished with a small 
number of molecules per procedure. Further, in general, the 
tethered molecules cannot be stored and used again. 
A combination of the sizing capability of gel 
5 electrophoresis and the ordering capability of certain single 
molecule techniques such as, for example, FISH, would, 
therefore, be extremely useful for genomic analyses such as 
genomic mapping. Such analyses would be further aided by the 
ability to manipulate the single molecules being analyzed 
10 Additionally, an ability to reuse the nucleic acid samples of 
interest would increase the efficiency and throughput 
capability of the analysis. Currently, however, there exists 
no single technology which embodies, in a practical manner, 
each of these elements. 
15 Citation of documents herein is not intended as an 

admission that any of the documents cited herein is pertinent 
prior art, or an admission that the cited documents are 
considered material to the patentability of the claims of the 
present application. All statements as to the date or 
20 representations as to the contents of these documents are 
based on the information available to the applicant and do 
not constitute any admission as to the correctness of the 
dates or contents of these documents. 

3. SUMMARY O F THE TMVie^y ftftf 
The present invention relates to methods and 
compositions for characterizing and manipulating individual 
nucleic acid molecules, including mammalian chromosome-sized 
individual nucleic acid molecules. The methods and 

30 compositions described herein can be used for the accurate, 
rapid, high throughput analysis of nucleic acid molecules at 
the genome level. This analysis may, for example, include the 
construction of high resolution physical maps, referred to 
herein as "optical mapping, •• and the detection of specific 

35 nucleotide sequences within a genome, referred to herein as 
"optical sequencing." 
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Specifically, methods are described by which single 
nucleic acid molecules, including mammalian chromosome -sized 
DNA molecules, are elongated and fixed in a rapid, controlled 
and reproducible manner that allows for the nucleic acid 
5 molecules to retain their biological function and, further, 
makes rapid analysis of the molecules possible. In one 
embodiment of such a procedure, the molecules are elongated 
in a flow of a molten or unpolymerized gel composition. The 
elongated molecules become fixed as the gel composition 
10 becomes hardened or polymerized. In such an embodiment, the 
gel composition is preferably an agarose gel composition. 
The elongated molecules became fixed as the agarose hardens. 

In a second embodiment, the single nucleic acid 
molecules are elongated and fixed in a controllable manner 
15 directly onto a solid, planar surface. This solid, planar 
surface contains a positive charge density that has been 
controllably modified such that the single nucleic acid 
molecules will exhibit an optimal balance between the 
critical parameters of nucleic acid elongation state, degree 
20 of relaxation stability and biological activity. Further, 
methods, compositions and assays are described by which such 
an optimal balance can precisely and reproducibly be 
achieved. 

In a third embodiment, the single nucleic acid molecules 
25 are elongated via flow-based techniques. In such an 

embodiment, a single nucleic acid molecule is elongated, 
manipulated (via, for example, a regio-specif ic restriction 
digestion) , and/or analyzed in a laminar flow elongation 
device. The present invention further relates to and 
30 describes such a laminar flow elongation device. 

The elongated, individual nucleic acid molecules can 
then be used in a variety of ways which have applications for 
the analysis of nucleic acid at the genome level . For 
example, such nucleic acid molecules may be used to generate 
35 ordered, high resolution single nucleic acid molecule 
restriction maps. This method is referred to herein as 
"optical mapping" or "optical restriction mapping." 
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Additionally, methods are presented by which specific 
nucleotide sequences present within the elongated nucleic 
acid molecules can be identified. Such methods are referred 
to herein as "optical sequencing." The optical mapping and 
5 optical sequencing techniques can be used independently or in 
combination on the same individual nucleic acid molecules. 

Still further, the elongated nucleic acid molecules of 
the invention can be manipulated using any .standard 
procedure. For example, the single nucleic acid molecules 
10 may be manipulated by any enzymes which act upon nucleic acid 
molecules, and which may include, but are not limited to, 
restriction endonucl eases, exonucleases, polymerases, ligases 
or helicases. 

Additionally, methods are also presented for the imaging 

15 and sizing of the elongated single nucleic acid molecules. 
These imaging techniques may, for example, include the use of 
fluorochromes, microscopy and/or image processing computer 
software and hardware. Such sizing methods include both 
static and dynamic measuring techniques. 

20 Still further, high throughput methods for utilizing 

such single nucleic acid molecules in genome analysis are 
presented. In one embodiment of such high throughput 
methods, rapid optical mapping approaches are described for 
the creation of high-resolution restriction maps. In such an 

25 embodiment, single nucleic acid molecules are elongated, 
fixed and gridded to high density onto a solid surface. 
These molecules can then be digested with appropriate 
restriction enzymes for the map construction. In an 
alternative embodiment, the single nucleic acid molecules can 

30 be elongated, fixed and gridded at high density onto a solid 
surface and utilized in a variety of optical sequencing- based 
diagnostic methods. Notably, such diagnostic grids can also 
be reused. Further, the high throughput and methods can be 
used to generate rapidly information derived from procedures 

35 that combine optical mapping and optical sequencing methods. 
The present invention provides techniques, including 
high throughput techniques, that reproducibly and rapidly 

- 6 - 
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generate populations of individual, elongated nucleic acid 
molecules that not only retain biological function but are 
accessible to manipulation and make possible rapid genome 
analysis . 

5 

4. BRIEF DESCRIPTION OF THE PIGURKS 

FIG. 1. Schematic drawing of an electrophoretic 
microscopy chamber which is specifically adapted to 
10 fluorescence microscopy studies. 

FIG. 2. Partly schematic and partly block diagram 
showing an interconnection of exemplary chamber electrodes in 
an electrophoresis chamber which may be used in the present 
15 invention. 

FIG. 3A-3B. Schematic illustration of the instrumen- 
tation used in the microscopic study of DNA molecules in a 
medium according to this invention, and a more detailed 
20 diagram showing the instrumentation for measuring 
birefringence . 

FIG. 4A-4I. Depicted herein are the DNA molecular 
conformational and positional changes when G bacteriophage 
25 molecules are subject to two sequential electric fields in 
different directions . 

FIG. 5A-5J. Depicted herein are the DNA molecular 
conformational and positional changes during relaxation of G 
30 bacteriophage DNA molecules after electrophoresis for 600 
seconds, as revealed by the fluorescence microscopy 
experiments described in Example 4 . 

FIG. 6. Optical mapping. DNA molecules and restriction 
35 enzyme are dissolved in molten agarose without magnesium 

ions. The DNA molecules are elongated by the flow generated 
when the mixture is sandwiched between a slide and coverslip. 

- 7 - 
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Stretched molecules are fixed in place by agarose gelation 
Magnesium ion diffusion into the gel triggers digestion and 
cleavage sites appear as growing gaps as the molecular 
fragments relax. 

5 

FIG. 7A-7D. Histograms of optical mapping. Not 1 cut 
frequencies, showing variation with molecule size and number 
of cut sites, are indicated. Cutting frequencies were scored 
by counting the number of Not 1 cuts in nucleic acid 

10 molecules present in microscope fields. Such fields 

typically contain approximately 3-5 molecules. Because 
approximately half the fields showed no Not 1 cutting and 
were, therefore, not scored, this underestimates the number 
of uncut molecules . The expected .number of cut sites and 

15 chromosome sizes: 7A: Ch. 1(240 kb; l; 7B. : Ch. V and 

VIII (595 kb) 3 and 2; 7C: Ch. XI (675 kb) 2; and 7D: Ch. XIII 
and XVI (950 and 975 kb) 1. Chromosome pairs V and VIli, and 
XIII and XVI were present on the same mount. 

20 FIG. 8A-8H. Depicted are some restriction fragment 

relaxation modes for a singly cleaved, gel-fixed, elongated 
molecule. Horizontal arrows indicate direction of 
relaxation. Relaxation modes illustrated: 8A depicts a 
fixed molecule before cleavage, 8B-8E depict possible 

25 relaxation modes producing detectably cleaved molecules, and 
8F-8H depict relaxation modes producing undetectably cleaved 
molecules . 



FIG. 9. Schematic representation depicting possible 
30 relaxation events to form pools of segments or "balls" at 
coil ends. Agarose gel is illustrated as a series of pegs 
with free spaces available for molecules. Gel pegs might 
intersect the embedded DNA molecule during gelation and 
possibly entrap it. The coil segments positioned in the pool 
3 5 region comprise a relaxed sub-coil region and have higher 
entropy than the coil stretched out between them. These 
pools may act as molecular rivets in some circumstances. 
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particularly if the segment pool mass approaches that of the 
intervening coil. 

FIG. lOA-lOB. Optical mapping sizing results for Not I 
5 endonuclease restriction fragments from S. cerevisiae 
chromosomes I, V, VIII, XI, XIII, and XVI calculated as 
described, plotted against published results. The diagonal 
line is for reference. Typical fragment images are shown in 
this figure. (See example 13). The inset shows the estimate 
10 of population standard deviation (kb) . Error bars represent 
90% confidence on means (main graph) or standard deviation 
(inset) . lOA: the relative intensity determination of 
fragment sizes. lOB: the relative apparent length 
determination of fragment sizes. 

15 

FIG. IIA-IIC. Scatter plot of normalized absolute 
intensity vs. apparent length. Absolute intensities from six 
individual images were calculated and plotted against 
apparent length over a time interval typically used in 

20 optical mapping (10-15 minutes) . For each sample, the 

initial intensity was found by averaging absolute intensity 
values from groups of 5 adjacent images and taking the 
largest value. The values from several samples were 
normalized by dividing values from each image by the initial 

25 intensity for the sample. IIA: chromosome I 120kb Not I 
fragment, 7 samples. IIB: chromosome XI 285kb Not I 
fragment, 4 samples. IIC: chromosome XI 3 60kb Not I 
fragment , 4 samples . 

30 FIG. 12. Comparison of Not I endonuclease restriction 

maps of optical mapping results of S. cerevisiae chromosomal 
DNA molecules with published restriction maps. Maps were 
constructed from length (Len) , intensity (Int) or a 
combination of both (Com) . Bar lengths for the optical 

35 mapping data are proportional to the means plotted in FIG. 
lOA-lOB, and typical images are shown in FIG. 13A-13F. 
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FIG. 13A-13F. Typical fluorescence microscopy images of 
S. cerevisiae chromosomal DNA molecules stained with DAPI and 
embedded in agarose gel during Not I restriction endonuclease 
cleavage. Chromosomal DNA molecules were prepared and fixed 
5 as described in Example 13 and cited references. Images were 
background corrected using a smoothed and attenuated 
background image, smoothed, and stretched, using 16 -bit 
precision. Images show Not I restriction digestion 
evolution, with arrows highlighting cut sites. Intervals are 

10 timed after addition of Mg^*. 13A: Ch. I {240kb) , 20 and 60 
sec; 13B: Ch. XI {675kb) , 500, 880 and 1160 sec; 13C: Ch. V 
(595kb), 200, 240, 520 sec; 13D: Ch. VIII (595kb) , 440, 1220 
and 1360 sec; 13E: Ch. XIII (950kb) , 100 and 560 sec; 13F: 
Ch. XVI (975kb), 460 and 560 sec. Bars, 5 ^m. A lOOx 

15 objective was used to image results in panels (13A-13D) and a 
63x objective was used for panels (13E and 13F) . 

FIG. 14. Optical mapping results from Rsr II and Asc I 
endonuclease restriction digest of 5. cerevisiae chromosomes 
20 III and XI. Maps were constructed from fully cut length 

(Len) or intensity (Int) data, and refined using partial cut 
length. Bar lengths are proportional to the calculated 
means, and typical images are shown in FIG. 15. Number of 
cuts was determined as in FIG. 7. 

25 

FIG. 15A-15C. Fluorescence microscopy images of 5. 
cerevisiae chromosomal DNA molecules stained with DAPI and 
embedded in agarose gel during Rsr II or Asc I restriction 
endonuclease cleavage. Chromosomal DNA molecules were 

30 digested and analyzed as in FIG. 13. Images show restriction 
digestion evolution, with arrows highlighting cut sites. 
ISA: Ch. Ill, Rsr II, 1100 and 1820 sec; 15B: Ch. XI, Rsr II, 
20, 600, 920, 1060 sec; 15C: Ch . XI, Asc I, 1160, 1500, 1780, 
1940 sec. An isoschizomer to Rsr II, Csp I, was also used 

35 and gave identical results. Bar, 5 ^m. 
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FIG, 16. Glass surface properties as a function of 
polylysine treatment . Glass surfaces were incubated for 16 
hours in different concentrations of poly-D- lysine , 
MWs=350,500. Lambda bacteriophage DNA molecules in EcoRI 
5 restriction buffer and ethidium homodimer, minus magnesium 
ions, were mounted onto the treated glass surfaces. Square 
and circle show ratio of absorbed DNA and average length of 
absorbed DNA, respectively. Each point represents roughly 50 
molecules measured and bars show the standard deviation about 
10 a mean. Sample preparation, imaging techniques and analysis 
are given in Methodology. 

FIG. 17. Gallery of fluorescence microscopy images of 
lambda clones from Optical Mapping results. Clones from a 

15 mouse yeast artificial chromosome (YAC) (Burke et al . , 
Science 236:806-812, 1987; Murray and Szostak, Nature 
305:189-193, 1983) spanning the Pygmy locus were subcloned 
into Lambda FIX II and digested with EcoRI and BamHI . Maps 
for these and other molecules (not shown) were constructed by 

20 Optical Mapping techniques (Methodology) and shown in Fig. 
19. Images show typical molecules used for map construction. 
Bars: 5 microns. Image v is an enlargement of image t and 
image w is at the same scale as image v. The enzymes used 
for map construction are indicated as (E) for EcoRI and (B) 

25 for BamH I. a, uncut lambda DNA; b, B3 (E) ; c, F (B) ; d, B 
(B) ; e, D (B) ; f, E (B) ; g, 914 (E) ; h, B (E) ; i, G (B) ; 
j, C (E) ; k., B4 (E) ; 1, Yll (E) ; m, 618 (E) ; n, 617 (E) ; o, 
305 (E) ; p, A (B) ; q, 1004 (B) ; r, E (E) ; s, B6 (E) ; t, A2 
(E) ; u, C3 (E) ; v, A2 (E) ; w, F (E) . 

30 

FIG. 18. EcoRI and BamH I endonuclease restriction 
fragment sizing results for Lambda FIX II clones, calculated 
as described and plotted against gel electrophoresis data, 
a, Relative fluorescence intensity results. The diagonal 
35 line is for reference. Typical fragment images are shown in 
Fig. 17. Inset: estimate of population standard deviation 
(kb) . Error bars represent 90% confidence on means (main 
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graph) or standard deviation (inset) . The size of the whole 
molecule was determined by gel electrophoresis, b, results 
for small fragments. The best fit line through the origin 
(slope 0.665) was used to calibrate fragment originally 
5 estimated at less than 6 . 5 )cb prior to incorporation into 
maps, c, results after correction, d, Relative apparent 
length sizing results from the same images. The diagonal 
line is for reference. 

10 FIG. 19. EcoRI and BamHI restriction maps constructed 

by Optical Mapping. Clones are labeled on the left side. 
The upper ticks are EcoRI restriction sites and lower ticks 
are BamHI sites. Table 1 shows the fragment sizes. 

15 FIG. 20. Optically sizing insert DNA of lambda FIX II 

clones. Lambda clones mounted on the surface were digested 
by an enzyme which cut at the polylinker sites, as described 
in Methodology. The 20 kb and 9 kb vector arms of FIX II 
cloning system were used as internal size standards to 

20 convert relative sizes to absolute sizes. The results of 
fluorescence intensity and length were showed in Table 2. 
together with sizes from PFGE. Cases where the enzyme also 
cut the insert were easily interpreted. Scale bar is 5 
microns. a, clone F (Sal I): 20 kb, 7.5kb, 9.5 kb, 9kb. b 

25 Clone G (Sal I): 20 kb, lO.l kb, 4.1 kb, 9 kb. c, clone B ' 
(NotI): 20 kb, 17.6 kb, 9 kb. d, B3 (SstI) : 20 kb, 13.8 kb 
9 kb. 



FIG. 21 DNA binding properties of glass surfaces as a 
30 function of APTES deposition. Yeast (AB972) chromosome I 
molecules (240 kb, 72 mm contour length, assuming B-DNA) in 
(10 mM Tris pH 7 . 6 , 1 niM EDTA, 50 mM NaCl) were applied in 
molten agarose to glass surfaces previously treated with 
APTES for the indicated time. The number and length of 
35 molecules was measured by fluorescence microscopy after 

staining with ethidium homodimer. The plot shows the average 
number of molecules deposited per 100 field viewed (square) 
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and the average molecule length (circle) , plotted against the 
time of prior APTES derivatization. Each point represents 
-60 molecules imaged. Bars indicate the standard deviation 
about the means. Sample preparation, imaging techniques and 
5 analysis are given in Materials and Methods. 

FIG. 22. Optical mapping sizing results for NotI 
endonuclease restriction fragments of cerevisiap 
chromosomes I, v, vili, and XI calculated as described 

10 (Example 13) plotted against published results (Link and 
Olson, 191, Genetics 127:681). The diagonal line is for 
reference. Each point represents 20 to 4 0 imaged fragments 
Inset: estimate of population standard deviation (kb) . Error 
bars represent the 90% confidence intervals. (a) Relative 

15 apparent length determination of restriction fragment sizes. 
(B) Relative fluorescence intensity determination of 
restriction fragment sizes. 

FIG. 23. Typical fluorescence micrographs of S 
20 cerevisiae chromosomal DNA molecules digested with NotI 
restriction endonuclease. Molecules were stained with 
ethidium homodimer after digestion. Arrows indicate cleavage 
sites, bars 10 microns. A, chromosome XI, two cuts; B 
chromosome V, three cuts ; and C, chromosome VIII, two cuts 
25 D, araphical comparison of optical mapping results and 

published PPGE restriction maps of yeast chromosomes digested 
with Notl. Bar lengths for the optical mapping data are 
proportional to the means based on the fluorescence intensity 
measurements plotted in Fig. 22. 



30 



FIG. 24. Typical fluorescence micrographs of yeast 
artificial chromosomes digested with NotI, Mlul, EagI and 
Nrul restriction endonucleases and stained with ethidium 
homodimer. Arrows indicate cleavage sites, bars 10 microns 
35 YAC 7H6 was digested with: A, Nrul; B EagI . YAC 314 was 
digested with: C, NotI; D, Mlul; E, EagI; F, NotI and Mlul ■ 
G, Mlul and EagI. Graphical comparison of optical mapping 
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results with PFGE mapping results for YACs: H, 7H6; I, 314. 
Double digestion results are included. Bar lengths for the 
optical mapping data are proportional to the means based on 
fluorescence intensity measurements. 

5 

FIG. 25 is a diagram depicting a laminar flow elongation 
device . 

FIG. 26 A, B, and C illustrate the characteristic 
10 "sunburst" pattern of fixation of elongated molecules using 
the spotting technique of the present invention. 

FIG. 27 A and B show relaxation measurements as a 
function of molecular size. 

15 

FIG. 28 A and B are logarithmic plots of relaxation 
versus size. 

FIG. 29 shows a enlarged view of a DNA spot and one 
20 method of spreading molecules onto a derivatized surface. 

FIG. 3 0 is a block diagram of a method for high 
throughput optical mapping of lambda or cosmid clones. 

25 FIG. 31 is a block diagram of the system used for high 

throughput optical mapping of gridded YAC DNA. 

FIG. 32 is a block diagram of one embodiment of the 
automated system for high throughput optical mapping. 

30 

FIG. 33 illustrates a method of optimizing the image 
collection process and maximizing the signal -co-noise ratio. 

FIG. 34 is a block diagram of the image processing 
35 method in accordance with a preferred embodiment of the 
present invention. 
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^' DETAILED DESCRIPT TON OP THE INVEMTIQW 

Described herein are methods and compositions for 
characterizing and manipulating individual nucleic acid 
molecules, including mammalian chromosome -sized individual 
5 nucleic acid molecules. The methods and compositions 
described herein can be utilized for optical mapping and 
optical sequencing purposes to generate accurate, rapid, high 
throughput analyses of nucleic acid molecules at the genome 
level . 

10 Specifically. Section 5.1 describes methods for the 

elongation and fixation of single nucleic acid molecules. 
Such methods include both agarose-based (Section 5.1.1) and 
solid surface-based (Section 5.1.2) techniques. Section 5 1 
also describes assays for the optimization of parameters 
15 important to the production of the solid, planar surfaces 
used herein. Further, Section 5.1 also describes flow-based 
elongation techniques (Section 5.1.3) in which a single 
nucleic acid molecule is elongated, manipulated and/or 
analyzed in a laminar flow elongation device. 
20 section 5.2 describes methods for the imaging and sizing 

of single nucleic acid molecules. The Section includes, for 
example, nucleic acid staining, microscopy and photography 
techniques (Section 5.2.1) useful for imaging single nucleic 
acid molecules. Further, the Section describes methods for 
25 the sizing of single nucleic acid molecules including both 
static and dynamic measurement techniques (Section 5.2.2). 
Section 5.3 describes genome analysis applications to which 
the single nucleic acid molecule techniques of the invention 
may be put. Such applications include, for example, optical 
30 mapping and optical sequencing techniques. Finally, Section 
5.4 discusses methods for rapid, high throughput utilization 
of the single nucleic acid techniques of the invention. 



35 
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5.1. SINGLE NUCLEIC ACID MOLECULE ELONGATION TECHNIQUES 

A variety of methods can be utilized for the rapid, 
controllable and reproducible elongation of single nucleic 
5 acid molecules in such a manner that allows rapid, efficient 
analysis and/or manipulation of the molecules. These 
techniques can include, for example, gel -based (Section 
5.1.1), solid surface-based (Section 5.1.2) and flow-based 
techniques (Section 5.1.3), each of which will be separately 
described below. 

5-1. 1. GEL-BASEn TECHNTDTn^ g 

Gel -based techniques can be utilized for the elongation 
of single nucleic acid molecules. The gel -based techniques 

15 described herein maintain the biological function of the 
nucleic acid molecules and, further, allow for the 
manipulation and/or accurate analysis of the elongated single 
nucleic acid molecules. Nucleic acid molecules which can be 
rapidly, efficiently analyzed via such gel -based techniary 

20 include nucleic acid molecules which range in length from 
about 20 kb up to mammalian chromosome -si zed lengths ( i.e. . 
greater than lOOO kb) . Further, such gel-based techniques' 
make possible the utilization of dynamic measurement 
procedures, may generate a lower level of nucleic acid 

25 shearing and make possible the utilization of a wide range of 
biochemical activities with which the manipulate the 
elongated nucleic acid molecules. 

Briefly, gel -based techniques involve elongating single 
nucleic acid molecules within a molten or nonpolymerized gel 

30 composition such that upon cooling or polymerization, the 
elongated nucleic acid molecules are maintained in a 
relatively stationary position, while remaining accessible 
to, for example, enzymatic manipulation and/or hybridization 
to complementary nucleic acid molecules or binding to 

35 sequence-specific proteins or peptides. Further, the 

gelation process restrains elongated nucleic acid molecules 
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from appreciably relaxing to a random coil conformation 
after, for example, their enzymatic cleavage. 

For optimal imaging and manipulation potential, the 
amount which the single nucleic acid molecules are elongated 
5 wxthin the gel composition is critical. Excessive elongation 
or stretching causes the molecule to become difficult to 
visualize. For example, too much stretching presents too 
little fluorochrome per imaging pixel, lending the 
intensities generated by the measured molecular intensities 
10 to approach background values. Insufficient stretching 
however, generates too low a level of tension, which can 
interfere with an analysis of single nucleic acid molecule 
manipulations. For example, when restriction mapping, enough 
elongation must occur such that, upon digestion, the newly 
15 formed nucleic acid fragments pull away from each other, thus 
revealing restriction sites. An additional requirement for 
optimal gel-based elongation requires that care be taken to 
preserve the moisture within the gel, such that the maximum 
biological function of the nucleic acid can be retained. 
20 For optimal imaging/manipulation potential, the extent 

to which a nucleic acid molecule is elongated within a gel 
must be great enough to generate a sufficient level of 
intramolecular tension while not being so great that the 
elongated molecule becomes difficult to image. In general 
25 elongation methods which produce single nucleic acid 
molecules that span approximately 20% to 60% of their 
curvilnear contour lengths are preferred. 

Further, the elongated nucleic acid molecules within the 
gel must lie within a shallow plane of focus for successful 
30 imaging. With respect to larger nucleic acid molecules, for 
example, it is additionally important for the molecules to 
lie within a plane approximately 0.2 Mm in thickness for 
focused visualization. 

Because gelation or polymerization fixes embedded 
35 molecules, systematically varying parameters which affect the 
rate at which the gelation or polymerization occurs can 
modulate the degree of fixation and, ultimately, the rate of 
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molecule relaxation. Smaller nucleic acid molecules ( i.e. . 
molecules less than about 350 kb) relax quickly. Thus, it' is 
preferred that elongation take place under conditions which 
hasten gelation/polymerization so that the nucleic acid 
5 molecules become trapped in an extended conformation before 
substantial relaxation takes place. Larger nucleic acid 
molecules relax at a slower rate, and, therefore, can be 
elongated under conditions which allow for a slower rate of 
gelation/polymerization . 
• 10 With respect to agarose gels, parameters which affect 

the rate of gelation include, for example, the gel 
concentration and/or temperature at which the gel is formed. 
A higher gel concentration or gelation at a low temperature 
hastens gel formation. With respect to polyacrylamide gels, 
15 parameters which affect the rate of polymerization include, 
for example, the acrylamide/bisacrylamide concentration and 
ratio, the temperature at which polymerization takes place, 
and the ammonium sulfate and TEMED concentrations used. 
While any gel composition may be used for such 
20 elongation techniques, an agarose gel composition is 
preferred, with an agarose composition exhibiting a low 
gelling temperature being especially preferred. Such low 
gelling temperature agarose compositions are the most 
optically clear agarose compositions available and, further, 
25 because such compositions can remain molten at 37°C, the 
biological activity of enzymes, such as restriction enzymes, 
within the molten agarose can easily be maintained. 
Additionally, such agarose compositions are useful in that 
rapid gelation is often desired for fixation of the elongated 
30 nucleic acid molecules. For agarose gel compositions, a gel 
composition comprising from about 0.1% to about 3.0%, with 
0.1-1.5% being preferred. 

Any number of techniques can be used to apply an 
external force which will cause the nucleic acid molecules 
35 within the gel composition to become elongated. For example, 
an elongating external force may include an electrical or 
mechanical force. While the exact amount of external force 
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required for optimal elongation may vary according to, for 
example, the specific gel composition and nucleic acid 
molecules being elongated, the optimization of gel parameters 
can easily and without undue experimentation be assayed by 
5 for example, utilizing the visualization and measurement 
techniques described in Section 5.2, below. 

Elongation may, for example, be accomplished by 
generating a flow force within a molten agarose gel 
containing single nucleic acid molecules. Such a flow force 
10 may be set up by placing the nucleic acid/molten gel 

composition between two solid surfaces, such as, for example, 
between a slide and a coverslip. m such an embodiment, a 
hole preferably exists in the slide through which reagents 
for the manipulation of the elongated nucleic acid molecules 
15 can be introduced into the gel. Alternatively, molecules may 
be elongated by pressing the nucleic acid/molten gel 
composition under, for example, a teflon stamp, as described 
m Section 5.4, below. 

An electrical force may, additionally, be generated via 
20 any standard electrophoretic method, including, for example, 
pulsed field (U.S. Patent No. 4,695,548) and pulsed oriented 
(POE) electrophoresis. When utilizing electrophoretic 
techniques, devices which are suitable for visualization by 
microscopy techniques are preferred. One such embodiment is 
25 the miniature POE device shown in FIGS. 1 and 2 and in 
Example 4, below. 

POE improves separation of polydisperse polymer 
molecules in a sample by using short electric pulses to 
create and vary field angles, with the effective field angle 

30 being defined by the vector sum of a series of pulses which 
may vary in duration, intensity and direction. Pulse times 
and pulse intensities are modulated to effect separation 
POE is also useful for creating effective field angles during 
imaging. The needed instrumentation is readily adapted to 

35 the microscope. 

An exemplary laboratory instrument for POE is illus- 
trated in FIG. 1 and a schematic view is shown in FIG. 2. 
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The instrument exemplified in FIG. l is similar to a 
miniature version of that described in U.S. Patent No. 
4,473,452, but differs in that the POE instrument has two 
sets of diodes 34 which enable bipolar operation of the 
5 discrete electrode array. The diodes 34 can be replaced by a 
multiganged relay (not shown) to provide similar electrical 
isolation. However, it is best to use the diodes 34 when 
very fast (less than 1 second) pulsing is needed. 

As depicted in FIGS. 1 and 2, the miniature electropho- 
10 resis chamber 50 used in this invention mea'sures about the 
size of a standard coverslip. it has electrodes 42', which 
are connected to diodes 34 (FIG. 2) . in order to generate 
the desired electric fields, platinum electrodes 42' are 
interconnected as shown in FIG. 2. In particular, d-c power 
15 supply 28 supplies d-c power to relays 30, which are 

controlled by a computer 32 to connect selected outputs to 
the d-c power from power supply 28. Computer 32 also 
controls d-c power supply 28 so that the potential of the 
power supply can be varied. Outputs to relays 30 are 
20 connected to electrodes 42' through respective diodes 34 for 
each electrode. 

As shown in FIG. 1, the miniature POE apparatus has a 
holder 52, which fits on a microscope stage. A slide 54, 
which holds an agarose gel, is placed into the holder and the 

25 electrodes 42 make electrical contact with the 

slide/gel/cover-slip sandwich placing drops of 30% glycerol- 
agarose at the agarose electrical connecting wicks 44. The 
glycerol prevents drying out of the gel. The electrical 
connector 46, which is part of the holder 52, provides a link 

30 to the bipolar diodes 34 and pulsing instrumentation shown in 
FIG. 2. 

As in the case of the instrument described in U.S. 
Patent No. 4,473,452, the presently exemplified instrument 
generates electrical fields which are orthogonal to each 
35 other, which alternate between high and low intensities out 
of phase with each other according to the chosen pulsing 
routine as described below and which translate the molecules 
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undergoing separation incrementally through the gel matrix in 
an overall direction transverse to the respective directions 
of the generated electrical fields. Due to the novel bipolar 
nature of the electrode design, it is possible to change 
5 polarities, simultaneously if desired, in addition to 

alternating high and low intensities without any significant 
electrode induced field distortions. 

The determination of effective field angle by a pulsing 
routine rather than by placement of an electrode array 
10 permits molecular orientations (and separations) that would 
otherwise be difficult. As described in Example 4 below, POE 
has been used in DNA imaging experiments. The 
electrophoresis apparatus pictured in FIGS, l and 2 and used 
m Example 4 may be preferred over that of U.S. Patent No. 
4,695,548 because varying the field angle by moving 
electrodes as taught by conventional pulsed field 
electrophoresis is not practical due to microscope stage 
physical constraints. 

As described above, gel -based techniques can 
20 successfully analyze single nucleic acid molecules ranging in 
size from approximately 20 kb up to chromosome -sized (i^ 
greater than lOOO kb, . Thus, techniques for the preparat;;n 
of the single nucleic acid molecules to be elongated should 
be chosen which avoid excessive shearing. Such techniques 
25 are well known to those of skill in the art and may include 
for example, techniques such as those described below 

First, agarose-embedded cell lysate techniques, such as 
those described in U.S. Patent No. 4,695.548, for preparing 
large DNA molecules without breakage can be adapted for use 
30 with the gel -based elongation techniques of the present 
invention. For example, cells may be washed, mixed with 
molten low melt agarose, which is then allowed to harden 
The resulting block is then placed into a lysis solution' 

35 ITTT^ '''''''' detergent, which diffuses into 

35 block, lysing the cells and rendering intact naked DNA 

molecules stripped of their associated proteins. The absence 
of physical manipulation keeps the DNA essentially intact 
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The agarose can then be melted and subjected to external 
elongating forces such as those described above. 
Alternatively, chromosomal DNA can first be resolved into 
chromosomal populations via standard methods such as, for 
5 example, pulsed field electrophoresis. The resolved DNA 
populations which may, for example, consist of populations of 
copies of the same chromosome, can then be subjected to the 
gel -based elongation methods described above. 

Additionally, a condensation agent may be used to 
10 collapse gel-bound nucleic acid molecules into small, shear- 
resistant balls, that can be unfolded with the addition of an 
ionic compound, such as, for example, sodium chloride or 
magnesium chloride, when appropriate. Preferably, the 
condensation agent is spermine. The spermine protocol, which 
15 is described further in Example 10, permits the mounting of 
extremely long DNA molecules with no detectable shear- 
mediated breakage. Nucleic acid molecules of extremely long 
length (i^, about 5.6 Mb) have been successfully condensed 
by such a technique with no appreciable shearing. In fact, 
20 it is conceivable that any size of nucleic acid can be 

inserted into a gel with no substantial shearing. While the 
use of spermine is preferred, other suitable materials for 
collapsing such nucleic acid molecules include any material 
which can cause a particular nucleic acid molecule to 
25 collapse, e.g., any condensation agent which causes nucleic 
acid molecules to preferentially solvate themselves. 
Additional examples of such materials include, but are not 
limited to, spermidine, alcohol and hexamine cobalt. 
Spermine -condensed DNA can be added to molten agarose, 
30 decondensed, and elongated according to the techniques 

described herein. Further, large nucleic acid molecules may 
initially be separated electrophoretically using, for example 
standard pulsed field electrophoresis techniques. The 
portion of the gel containing the separated molecules of 
35 interest may then be excised. 

The excised portion of the gel can then be used as part 
of the gel-based techniques of this Section. Additionally, 
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nucleic acid molecules in solution can be gently mixed with a 
molten agarose solution and utilized as part of the 
techniques of this Section. 

Once single nucleic acid molecules have been 
5 satisfactorily -elongated and fixed within the gel 

compositions as discussed herein, any of the analysis and/or 
manipulation techniques described in Section 5.3, below may 
routinely be utilized. 

5.1.2. SOLID Slip FACE -BA.qp.n TECHNTQ TTCg 
Solid surface-based techniques can be utilized for the 
rapid, controllable and reproducible elongation and fixation 
of single nucleic acid molecules, as described in this 
section. Upon elongation and fixation of the single nucleic 
15 acid molecules onto the solid surfaces as described herein, 
any of the analysis and/or manipulation techniques discussed 
below, in Section 5.3, may easily be performed. 

Such solid surface-based elongation/fixation techniques 
yield a number of advantages for single nucleic acid 
20 analysis/manipulation applications. For example, the nucleic 
acid molecule images are very sharp and bright. This is due 
m part, to the absence of gel-based image scattering, and to 
less extraneous fluorescence background in the field. 
Additionally, fixation techniques can be more precisely 
25 controlled and may, for example, be made somewhat tighter 
than those described, above, in Section S.l.i, for gel-based 
techniques. Thus, the solid surface -based techniques 
described herein make possible the rapid generation of high 
resolution nucleic acid analysis information from single 
30 nucleic acid molecules, including single nucleic acid 

molecules of much shorter lengths than currently available 
using the gel-based techniques described, above, in Section 
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A wide size range of nucleic acid molecules,!^, from 
35 about 300 bp to mammalian chromosome -size (that is greater 
than 1000 kb) can efficiently be elongated and stably fixed 
onto the solid surfaces described herein. These techniques 
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feature gentle fixation approaches which maintain the 
biological function of the nucleic acid molecules being 
elongated and, further, allow for the manipulation and/or 
accurate analysis of the elongated single nucleic acid 
5 molecules. Additionally, the solid surface-based techniques 
described herein make possible the storage and reuse of the 
elongated nucleic acid molecules. Further, such solid 
surface -based techniques described herein can easily be 
adapted for high throughput methods, as described in Section 
10 5.4, below. 

The elongation procedures described in this Section 
utilize solid surfaces which exhibit a positive charge 
density, as described, below, in Section 5.1.2.2, below. As 
discussed, below, in Section 5.1.2.1, however, the density of 
15 the solid surface positive charge must be optimized to 

achieve a balance between elongation, relaxation, stability 
and biological activity parameters, 

5.1.2.1. SOLID SURFACE OPTIMIZATION 
20 Unlike instances in the past in which nucleic acid 

molecules were attached to solid surfaces, the controlled, 
reproducible solid surface elongation/fixation techniques 
described herein utilize surfaces, especially glass surfaces, 
which reproducibly elongate and fix single nucleic acid 
25 molecules. As discussed in greater detail, below, in Section 
5.1.2.2, the surfaces described herein exhibit a positive 
charge density. Several parameters must be taken into 
account, however, in order to optimize the solid surface 
charge density such that, for example, the genome analysis 
30 techniques described, below, in Section 5.3, can be 
performed . 

The solid surfaces of the invention should exhibit a 
positive charge density which achieves an optimal balance 
between several parameters, including elongation, relaxation, 
35 stability and biological activity. Assays are described in 
this Section which make surface optimization possible. 
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First, the solid surface must allow the molecule to be 
as completely elongated as possible, while allowing for a 
small degree of relaxation. As used herein, "small degree of 
relaxation" refers to a level of relaxation which yields a 
5 gap of between about 0.5 microns and about 5.0 microns when 
the elongated nucleic acid molecule is cut. An optimal 
balance between these two parameters yields improved imaging 
capability. For example, an efficient balance between 
elongation and relaxation capability facilitates the imaging 
10 of newly formed, growing gaps as they develop at restriction 
enzyme cleavage sites. 

In addition to elongation and' relaxation, the biological 
activity retained by the elongated nucleic acid molecule must 
be taken into account when optimizing the positive charge 
15 density of the elongation/fixation solid surface. Further, 
the stability of the elongated nucleic acid molecules on the 
surface must be considered. In the case of a restriction 
digest (L^, as part of an optical mapping procedure) , 
"stability" refers to how well the restriction fragments 
20 formed are retained on the solid surface. 

As a first step toward determining the positive charge 
density which represents an optimal balance between each of 
these parameters, the positive charge density ( e.g. . the 
level of surface derivatization; see Section 5.1.2.2, below) 
25 may be titrated against the measured average molecular length 
of the nucleic acid molecules which are deposited on the 
surface. Molecule counts (i^, the number of countable 
molecules which have been deposited) on the surface can also 
be measured. 

30 At low levels of positive charge density ( e.g. . 

derivatization) , the average molecular extension on the 
surface is low. This may be due to the fact that, at this 
charge concentration, not enough nucleic acid binding sites 
exist to hold an extended molecule with stability. As the 

35 positive charge density {^^, the level of derivatization) 
increases, the average nucleic acid molecular extension also 
increases, eventually peaking. As the positive charge 
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density (g^^, the amount of derivatization) continues to 
further increase, the average amount of molecular extension 
then begins to decrease. This may be due to the presence of 
such an abundance of nucleic acid binding sites that any flow 
5 forces which are present and would drive elongation are 
overwhelmed and, therefore, molecular extension is, to some 
ext ent , quenched . 

Once a positive charge density ( e.g. . a derivatization 
level) is achieved which affords maximum nucleic acid 

10 molecule extension, the elongation parameters must be tested 
within the context of the specific imaging or analysis 
procedure for which the single molecules are to be used. 
Such testing involves an evaluation of the biological 
activity of the nucleic acid molecule as well as a 

15 determination of the relaxation level of the elongation 
nucleic acid. For example, in instances whereby the 
elongated nucleic acid molecules are to be used for optical 
restriction mapping, the level of elongation/fixation must 
allow for cutting by the restriction enzyme as well as 

20 providing a level of relaxation which makes possible the 
ready imaging of nascent restriction enzyme cleavage sites. 

In the case of optical mapping, one such test would 
include the digestion of the elongated nucleic acid molecule 
and a determination of, first, the enzyme's cutting 

25 efficiency, and, second, a measurement of the size of the 
nascent gap formed at the new cleavage sites (thus measuring 
relaxation) . A cutting efficiency of at least about 50% is 
an acceptable level of biological activity retention. 
Acceptable relaxation levels are as described above. 

30 Further, the stability of the elongated nucleic acid 

molecule must be ascertained. As discussed above, in the 
case of optical mapping, stability refers to the retention 
level of newly formed restriction fragments on the surface. 
For optical mapping, an acceptable stability level is one in 

35 which at least about 80% of the newly formed restriction 
fragments . 
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5.1.2.2. SOLID SURFACE POSITIVE CHARGE DENSITY 
solid planar surfaces may be prepared for optimal 
elongation and fixation of single nucleic acid molecules via 
a variety of simple manipulations. First, for example, the 
5 surfaces may be derivatized to yield a positive charge 
density, which can be optimized by utilizing the assays 
described in Section 5.1.2.1, above. Additionally, simple 
manipulations may be performed to reversibly modulate the 
surface positive charge density to more precisely optimize 
10 surface charge density at each step of the nucleic acid 
elongation, fixation analysis and/or manipulation steps 
such reversible charge density modulation is referred to 
herein as "facultative fixation", as discussed below. Third 
additional methods for further affecting the 
15 elongation/fixation of the single nucleic acid molecules are 
discussed. These include, for example, methods for 
controlled drying, for the generation of gradients of 
positive charge density and for crosslinking of the elongated 
nucleic acid molecules. 

20 

5.1.2.2.1. Surface Derivatization 

surfaces may be derivatized using any procedure which 
creates a positive charge density which, presumably, favors 
an interaction with a nucleic acid molecule. Any compound 

25 which absorbs to or covalently binds the surface of interest 
and further, introduces a positive charge density onto the 
surface can be utilized as a derivatizing agent. Such 
compounds should not, preferably fluoresce. For example 
surfaces may be derivatized with amino moiety-containing' 

30 compounds that absorb to or covalently bind the surface of 
interest. Such amino- containing compounds can, for example 
include amino-containing silane compounds, which are capable 
Of covalently binding to surfaces such as glass. Among these 
amino-containing silane compounds are 3- 
35 aminopropyltriethoxysilane (aptes) 3 -methylaminosilane 

APTES can be useful in that it may be crosslinked, while the 
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use of 3-methylaminosilane may, in certain instances, be 
advantageous in that the compound resists oxidation. 

Among those derivatizing agents which non-covalently 
absorb to surfaces, such as glass surfaces, may, for example, 
5 be derivatized with poly-D-lysine (polylysine) . Polylysine ' 
binds glass via electrostatic interactions. Polylysine may 
be especially advantageous for pressure-based elongation 
techniques (see Section 5.1.2.3, below), when utilizing 
polylysine as a derivatizing agent, the size of the polymeric 
10 polylysine is to be taken into account. For example, low 

molecular weight polylysine (e^, mw less than 200,000; with 
about 90,0000 being preferred) appears to fix elongated 
nucleic acids more tightly than high molecular weight 
polylysine (g^, mw greater than 200,000, with 500,000 being 
15 preferred) . Thus, when elongating and fixating on a solid 
surface which having polylysine, a low molecular weight 
polylysine would be preferred for tighter fixation, for 
the fixation of smaller nucleic acid fragments. 

Surface derivatization may be achieved by utilizing 
20 simple, reproducible techniques. When derivatizing a surface 
with APTES, for example, a clean surface, such as a glass 
surface, may be incubated in an acidic APTES solution for a 
given period of time. Increasing the incubation time will 
increase the resulting charge density of the surface. It is 
25 preferred that conditions should be chosen such that the 

single nucleic acid molecules are elongated to approximately 
50-100% of their polymer contour length. 

In one embodiment of such an APTES derivatization 
procedure, a clean glass surface can be incubated for an 
30 appropriate period of time in an APTES concentration of about 
0.10 M, pH 3.5 at a temperature of about 65° C. Incubation 
times for such an embodiment can range from about 3 to about 
18 hours. In order to stop the derivatization process, the 
surfaces need only be removed from the APTES solution and 
35 repeatedly rinsed in highly pure water. The clean, 
derivatized surfaces are then air dried. 
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With respect to derivatizing a surface with polylysine, 
a clean surface, such as a glass surface, can be derivatized 
in a polylysine solution. The concentration and molecular 
weight of the polylysine used for derivatization affect the 
5 level of derivatization achieved per incubation time. 
Increasing the polylysine concentration increases the 
resulting surface charge density which forms. For optical 
mapping purposes, conditions should be chosen such that 
single nucleic acid molecules are extended up to about 100% 
10 of their polymer contour length. 

In one embodiment of such a polylysine derivatization 
method, a clean glass surface can be incubated overnight, at 
room temperature, in a solution of polylysine having a 
molecular weight of about 350,000, at a concentration of 
15 about 10- to 10- grams per milliliter. After incubation, the 
derivatized glass surface is rinsed in highly pure water and 
either air dried or wiped dry with lens tissue paper. Such 
conditions are expected to achieve nucleic acid elongation 
levels which are suitable for, say, optical restriction 
20 mapping. 

In addition to methods which involve the use of a 
derivatizing agent such as described above, a positive charge 
density may be introduced onto a surface by a number of 
alternate means. Such a positive charge density may, for 
25 example successfully be applied to a surface via plasma 
derivatization, an electrostatic generator (to create 
electrical charge) or corona discharge, just to name a few. 

5.1.2.2.2. Facultative Fixation 
30 Described herein are methods for the reversible 

modulation of solid surface positive charge density. Such 
methods are designed to optimize solid surface charge density 
at each step of the elongation, fixation and 

analysis/manipulation steps described herein. Among the ways 
35 by which such a reversible charge density can be effected 
include changes in the salt concentration, divalent cation 
concentration, effective water concentration, and/or pH. 
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Using facultative fixation, the surface positive charge 
density can be tailored to suit each step of the single 
nucleic acid techniques described herein. For example, it 
may be desirable to fix the nucleic acid molecule under 
5 reversible conditions which favor a loose charge density, 
leading to a higher degree of nucleic acid molecule 
spreading. The charge density may then, for example, be 
increased for a restriction digest step. Additionally, it 
may be desirable to digest a molecule so tightly fixed that 
10 no relaxation gaps form upon cleavage and then to 

subsequently lower the charge density such that the gaps are 
allowed to form. Finally, a very high charge density may 
then be chosen if the sample is to be stored (i^, such that 
the newly formed restriction fragments do not detach from the 
15 surface during storage) . 

With respect to salt concentration, as the salt 
concentration the surface finds itself in increases ( e.g. . 
from 0 to 5M NaCl) , the surface positive charge density 
decreases. With respect to divalent cation ( e.g. . Mg*', Ca^-) 
20 concentration, as the divalent cation concentration in the 
buffer surrounding the surface increases ( e.g. . imM to IM) , 
the surface positive charge density decreases. As the 
effective water concentration is decreased, due to the 
addition of an increasing concentration of non-aqueous 
25 material, the surface positive charge density increases. 

Changing the pH represents a gentle and fast method to 
reversibly modulate the charge density of a surface. A low 
pH promotes positively charged environment, while a high pH 
promotes a less positively charged, more neutral environment. 
30 Taking, as an example, a surface which has been 

derivatized using an amino- containing group, an aminosilane 
compound, for example, a pH of approximately 6 yields a 
positive charge density. Raising the pH lowers the charge 
density until the charge is essentially neutral at a pH of 9- 
35 10. A variety of simple methods may be utilized to produce 
pH-based facultative fixation. For example, the surface can 
be exposed to buffers, such as Tris or phosphate buffers, of 
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varying pH. Additionally, gas-induced pH changes can be 
made. For example, CO, gas can be introduced over the buffer 
in which the derivatized surface is submerged such that the 
buffer is acidified, thereby increasing the overall charge 
5 density on the surface. Alternatively ammonia gas, for 
example, may be introduced over the buffer,, raising the 
buffer pH, thereby lowering the overall surface charge 
density. These latter gas-based techniques are especially 
useful in instances whereby it is essential to minimize 
10 possible physical disturbances on the solid surface in that 
the buffer remains undisturbed throughout the facultative 
fixation process. 

5.1.2.2.3. Other Positive Charge Density Methods 
15 Derivatiration gradients. m addition to a uniform 

controllable derivatization of an entire solid surface, it is 
also possible to reproducibly form a gradient of 
derivatization. Such a derivatization gradient can be formed 
by, for example, the use of drops of derivatizing agents 
20 deposited on the solid surface. Upon deposition, such a drop 
would form a meniscus, leading to a greater concentration of 
derivatizing agent available to the solid surface at the 
perimeter of the drop than within its interior section 
This, in turn, leads to a gradient of derivatization, with 
25 the outer portion of the solid surface where the drop had 
been exhibiting a higher level of derivatization than that 
within the interior. 

Such a gradient of derivatization promotes a higher 
percentage of fully elongated molecules. Further, due to the 
30 tension set up across the nucleic acid molecule, a more 
efficient level of aligning and packing is observed, thus 
maximizing the amount of usable molecules per imaging field, 
one goal of invention. 

35 Crosslinking. The single elongated nucleic acid 

molecules of the invention may, additionally, be crosslinked 
to the solid surface. Such crosslinking serves to 
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permanently fix the molecules to the surface, which can be 
advantageous for a variety of reasons. For example, 
crosslinking may be useful when working with very large 
nucleic acid molecules. Further, the surface properties of 
5 the solid may be modulated with no possibility of nucleic 
acid loss. Additionally, the possibility of unacceptable 
nucleic acid fragment loss or relaxation which could occur 
over the course of, for example, storage or a long reaction, 
would not exist with crosslinking. 
10 Crosslinking, as utilized herein, is to be performed in 

con:unction with the elongation/fixation techniques described 
xn these Sections. First, the desired level of elongation is 
determined and achieved, and subsequent to this, the 
elongated nucleic acid is crosslinked for permanent fixation. 
15 A number of crosslinking methods are available, 

including glutaraldehyde and UV crosslinking. Glutaraldehyde 
crosslinking may be performed using, for example, a 5 minute 
incubation in a lo mM glutaraldehye solution. UV 
crosslinking may be accomplished using, for example, a 
20 Stratalinker (Stratagene) crosslinker, following standard 
protocols. 



Controlled Drying. Additional compounds may be added to 
the aqueous solution by which the nucleic acids may be 

25 deposited onto the solid surfaces (see below for deposition 
techniques) which yield drying characteristics that promote 
the production of a greater percentage of fully elongated 
nucleic acid molecules and which exhibit a lower level of 
intermolecular overlap or tangling, both features of which 

30 are extremely useful for analysis purposes. 

Compounds which may be added for such a controlled 
drying aspect of the elongation methods include, but are not 
limited to glycerol, DMSO, alcohols, sucrose, neutral 
polymers such as Ficoll, and dextran sulfate. While their 

35 mechanism is not known, it is possible that these compounds 
promote a liquid crystalline state which promotes the above- 
described features. 
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Hydrophobic microwelle. Hydrophobic regions may be 
introduced onto portions of the solid surfaces which can 
serve as. essentially, "microwells". These hydrophobic 
regions create closed boundaries, which make possible the 
5 introduction of different reagents onto different portions of 
the solid surface, such that a number of different reactions 
may be performed simultaneously on the same solid surface. 

Prefixation techniques. The solid surfaces of the 
10 invention may be prefixed with agents, proteins for example, 
of interest, prior to the introduction of the nucleic acid 
molecules to be elongated. Proteins may be fixed onto the 
solid surfaces by routine means, such as crosslinking means, 
which are well known to the skilled artisan. Among the 
15 proteins which may be prefixed onto the solid surfaces of the 
invention are enzymes, such as restriction enzymes, which are 
used to manipulate nucleic acid molecules or any other 
nucleic acid-binding proteins. Thus, upon elongation of 
nucleic acid molecules onto the solid surfaces containing 
20 such prefixed enzymes and the addition of whatever additional 
agents, such as certain divalent ions, which are necessary 
for the enzymes to act upon nucleic acids, the single nucleic 
acid molecules can be manipulated, e.g. . cleaved at 
appropriate restriction sites. Using such a prefixation 
25 technique, a number of different reactions may be performed 
simultaneously on the same surface. 

5.1.2.3. SINGLE NUCLEIC ACID MOLECULE DEPOSITION 

As described above, a wide size range of nucleic acid 

30 molecules may be deposited onto the derivatized solid 
surfaces described herein. Specifically, nucleic acid 
molecules from about 300 base pairs to greater than 1000 kb 
can be analyzed using such solid surfaces. Smaller nucleic 
acid molecules, which are relatively shear resistant, can be 

35 isolated using standard nucleic acid purification techniques 
well known to those of skill in the art. These smaller 
nucleic acid molecules may be less than about 150 kb and, 
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generally, are less than about 20 kb. Larger nucleic acid 
molecules, which are subject to breakage by shearing events, 
can be isolated by utilizing, for example, the nucleic acid 
molecule isolation techniques described, above, in Section 
5 5,1. Such shear-sensitive nucleic acid molecules are 
generally greater than 150 kb, but may include molecules 
greater than about 20 kb. 

Larger nucleic acid molecules ( i.e. . those greater than 
about 90 kb) should, generally, be deposited onto the solid 
10 surfaces in a manner which minimizes breakage due to shear 
forces. Preferably, therefore, these larger nucleic acid 
molecules are deposited onto the surfaces in molten agarose. 
For example, molten agarose containing nucleic acid molecules 
can be spread onto surfaces under conditions which generate a 
15 flow force that facilitates elongation. In a preferred 
embodiment, drops or droplets of molten agarose containing 
nucleic acid molecules are deposited onto the surface. The 
force generated when the drop hits the surface is sufficient 
to provide the required elongation. Upon hardening, the 
20 agarose is scraped off the surface, leaving behind intact, 
elongated, fixed nucleic acid molecules. 

In instances in which smaller nucleic acid molecules 
( i.e. , ones ranging from about 300 bp to about 90 kb) are 
being deposited, the above gel techniques can be utilized. 
25 Further, the nucleic acid molecules can be deposited onto the 
surface in an aqueous solution. Elongation can then be 
achieved via various methods. For example, molecules can be 
sandwiched between two surfaces, one of which is the 
derivatized surface. In such a procedure, one of the two 
30 surfaces should contain a hole through which reagents may be 
introduced. Alternatively, the solution on the derivatized 
surface containing the nucleic acid molecules can be pressed 
with, for example, a teflon stamp. 

Preferably, however, the nucleic acid molecules 
35 deposited in such an aqueous fashion can be elongated by 
merely allowing the aqueous solution to dry. Thus, in the 
absence of any manipulations apart from simple deposition 
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onto a derivatized surface of the invention, single nucleic 
acid molecules can efficiently, successfully and rapidly 
generate stably elongated and fixed nucleic acid molecules 
suitable for imaging and/or further manipulation. As 
5 described, below, in Section 5.4, such a technique is 
especially suited to high throughput analysis techniques. 

5.1,3. FLOW -BASED TECHNIQUES 

The single nucleic acid molecules of the invention may 
10 be elongated manipulated and/or analyzed in flow-based 
techniques such as those described in this Section. Such 
techniques may be especially useful in instances whereby only 
low concentrations of the nucleic acid molecules of interest 
are available. 

15 Briefly, such a flow-based technique involves the 

introduction of a single nucleic acid molecule into a laminar 
flow elongation device. Gentle solvent flow fields are 
generated within the device which cause the nucleic acid 
molecules to be elongated without significant shearing. 

20 Further, as the elongated nucleic acid molecule flows through 
the laminar flow elongation device, it can be imaged via, for 
example an attached microscope and camera. Still further, 
the methods described herein make possible the controlled, 
regio-specif ic restriction digests of the elongated nucleic 

25 acid molecules which, coupled with the flow aspect of the 
device, makes possible the generation of real-time 
restriction maps. 

A preferred embodiment of such a laminar flow elongation 
device is illustrated in FIG. 25. Briefly, such a device, 

30 which is designed to liberate and elongate nucleic acid 
molecules out of gel inserts, comprises a laminar flow 
chamber to which are attached an extraction area and a 
viewing/manipulation area. While the device diagrammed in 
FIG. 25 depicts a single laminar flow chamber, a multiplexing 

35 laminar flow elongation device may also be utilized. Such a 
device may contain, for example, a branched laminar flow 
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chamber, such that multiple analyses of copies of identical 
single nucleic acids can be accomplished rapidly. 

The laminar flow chamber should contain a thin space, 
for example, a space generated via a 10-20 micron opening. 
5 The solvent flow generated within the chamber should be 
gentle enough to avoid significant shearing of the nucleic 
acid molecules. For example, one acceptable flow would be 
approximately 5 x IQ-^nl/sec at 100 x 20 micron opening. The 
fluid flow may be generated by a pumping means attached to 
10 the chamber upstream of the extraction and the 

viewing/manipulation areas or, alternatively, may be 
generated by a vacuum means attached to the chamber 
downstream of the extraction and the viewing/manipulation 
areas . 

15 The extraction chamber, through which the laminar flow 

chamber passes, serves to simultaneously liberate the nucleic 
acid from a gel insert and to move the nucleic acid into the 
flow of the device. Such an extraction chamber comprises 
electrodes which set up an electric field through which the 

20 nucleic acid moves out of the insert and into the flow of the 
laminar flow chamber. 

The viewing/manipulation chamber comprises a 
microscope/light source mounted chamber through which the 
laminar flow chamber passes. The microscope is preferably an 

25 epif luoresence microscope containing an oil immersion 

objective, to which is attached a camera, preferably a video 
camera. The elongated nucleic acid molecules can be 
visualized and, optionally, their images can be recorded, as 
the molecule passes through the viewing/manipulation chamber. 

3 0 In a preferred embodiment of such a procedure, the 

nucleic acid molecules are enzymatically manipulated as they 
pass through the viewing/manipulation chamber. Taking the 
case of optical mapping as an example, the elongated, flowing 
nucleic acid molecules can be digested with restriction 

35 enzymes as they pass through the viewing/manipulation 
chamber . 
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For example, the fluid in the laminar flow chamber can 
contain restriction enzymes and each of the reagents 
necessary for digesting the nucleic acid molecule flowing 
through the chamber, except that the divalent cation (usually 
5 Mg^*) which is necessary for enzyme activity is present in a 
reversibly chelated form. As such, the nucleic acid is 
protected from digestion until the divalent cations are 
liberated. By chelating the divalent cations with, for 
example, a light -inactivated chelator such as, for example, 

10 DM-nitrophen, as described below in Section 5.3, the cations 
can be released within the viewing/manipulation chamber as 
the fluid passes through the microscope light source. Thus, 
the nucleic acid molecule first becomes subject to digestion 
as it passes through the viewing/manipulation chamber. 

15 Further, as digestion occurs, the flow maintains the order of 
the resulting restriction fragments, which are imaged and 
which, therefore, instantly produce restriction maps which 
have been generated in real time. An example of such a 
photo- inactivated chelator is described, below, in Section 

20 5.3. 



25 
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5.2, SINGLE NUCLEIC ACID MOLECULE IMAGING AND SIZING 
TECHNIQUES 

In accordance with the present invention, elongated and 
5 possibly fixed single nucleic acid molecules can be imaged 
and sized using a number of different techniques to obtain 
estimates of various molecular parameters of interest. To 
this end, in a preferred embodiment of the present invention, 
molecules being imaged are first stained with f luorochromes 

10 which are absorbed by the molecules generally in proportion 
to their size. Digital images of the molecules are next 
generated for subsequent processing. In particular, the size 
of the imaged molecules can be determined from measurements 
of the fluorescent intensity of the molecule when it is 

15 illuminated with an appropriate light source, from 

measurements of the contour of the digitized molecule, or 
from the dynamics of molecular relaxation measured using a 
series of digitized images. The steps of imaging and sizing 
of nucleic acid molecules in accordance with different 

20 embodiments of the present invention are described in more 
detail next. 

5.2.1. IMAGING 

The first processing step in accordance with the present 

25 invention is to stain the molecules of interest in order to 

make them available for subsequent imaging. The following 

table summarizes f luorochromes used in a preferred embodiment 

of the present invention 

F luorochromes 
..A) DNA counter stains 
(PI) 
DAP I 

Hoechst 33258 
Quinacrine 
Chromomycin 

B) Hybridization 
35 site labels 

FITC 
TRITC 
XTRITC 

. 38 - 



for imaging purposes. 

Excitation max Emission max 

330 and 520 620 

350 460 

360 470 

455 495 

430 470 



490 
554 
580 



520 
573 
600 
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TR 596 620 

AMCA 350 450 

CY5= 646 663 

In accordance with the present invention, molecules can 
5 be imaged using fluorescent beads and chemiluminescent 
tagging employing alkaline phosphatase. Generally, single 
fluorescent beads including the smallest ones with a diameter 
of just 0.01 microns are easily imaged with fluorescence 
microscopy. Fluorescent beads provide a reliable way to 

10 label single DNA molecules for image processing purposes 
because individual beads are intensely fluorescent, 
morphologically distinctive, available in wide range of 
fluorochromes of differing spectral qualities, and are easily 
attached to oligonucleotides. (It should be noted, however, 

15 that in practice, if the Rayleigh limit is exceeded a bead 
would appear as a bright spot which is inadequate for image 
processing) . Latex beads are sold, for example, by Molecular 
Probes, Inc., and are available with coatings of carboxylate, 
avidin or streptavidin in six spectral ranges (colors) and 

20 sizes varying from 0.01 to 2 microns. Carboxylate modified 
and streptavidin coated beads provide a number of 
alternatives for binding the beads to DNA molecules. 

Furthermore, synthesizing oligonucleotides can be 
covalently attached to a series of differently sized 

25 fluorescent beads (0.01-0.05 microns) to optimize RARE 
conditions. (RARE stands for RecA-assisted restriction 
endonuclease . The RARE technique is described in more detail 
in Section 5.3.1. Briefly, this technique involves the 
generation of restriction endonuclease cleavage events that 

30 occur solely within the specific hybridization product) . 
Smaller beads are generally preferable because they diffuse 
more readily through agarose gel. On the other hand, larger 
beads are easier to derivatize due to their larger surface 
area. Fluorescent beads of similar size can be imaged 
35 electrophoresing through gels by fluorescence. Forming RecA 
filaments using these modified oligonucleotides and assaying 
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their formation by functionality in a RARE test system can 
also be used for imaging purposes. 

5.2.1.1. CHEMILUMINESCENT DETECTION 
5 Chemiluminescent labeling of oligonucleotides for non- 

isotopic detection in Southern blots (as well as other 
techniques) is a popular labeling technique especially 
because of its high sensitivity. In general, the method 
consists of attaching alkaline phosphatase to 

10 oligonucleotides (using commercially available systems) , and 
then hybridizing to target DNA. Following formation of 
hybrids, a chemiluminescent substrate is added, usually 1,2 
dioxetane, which rapidly decomposes into a chemiluminescence 
generating compound. Light is next emitted with a maximum at 

15 470 nm and a half life of 2-30 minutes, depending upon the 
chemical environment. 

Given the high sensitivity of the method and the 
availability of high quality commercial kits, 
chemiluminescence can be used in accordance with the present 

20 invention to optically detect RARE on single DNA molecules. 
For example, using commercially available kits, alkaline 
phosphatase can be covalently linked to oligonucleotides or 
DNA can be linked a to biotin-streptavidin attachment. The 
conjugated oligonucleotides are then formed into RecA 

25 filamer^xs and tested for RARE effectiveness. Excess 

biotinylated alkaline phosphatase can be easily dialyzed out 
of the system to reduce stray chemiluminescence using biotin- 
streptavidin mediated alkaline phosphatase linkages. A 
chemiluminescent detection system can be used with RARE and 

30 basic optical mapping. The RecA-oligonucleotide (linked to 
alkaline phosphatase) -target DNA complex is placed in molten 
agarose gel and then mounted for optical mapping. Instead of 
using magnesium ions in to trigger enzymatic cleavage, 
dioxetane is diffused in a preferred embodiment, as required 

35 by the chemiluminescence system for visualization of RARE 

sites. The chemiluminescence activity can then be visualized 
through a microscope using an ICCD camera, with no 
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illumination necessary. To image the entire molecule, DNA- 
fluorochrome fluorescence and different f luorochromes can be 
used if the initial compounds quench or interfere with 
chemiluminescence . 

5 

5.2,1.2. IMAGED ENERGY TRANSFER 

In accordance with the present invention an alternative 
approach to molecular imaging is to use energy transfer 
between the fluorochrome- labeled DNA and a bead attached to 

10 the oligonucleotide. Excitation can be selected to make the 
DNA- fluorochrome complex the donor and the bead the acceptor. 
In this case the bead could fluoresce only when it is within 
100 angstroms or less of the donor, however, the efficiency 
of energy transfer falls off rapidly with distance. Energy 

15 transfer imaging using fluorescence microscopy with different 
microscope filter combinations allows visualization of the 
donor, acceptor, and the donor-acceptor pair; these are 
conveniently slid in and out of the illumination path. In 
accordance with a preferred embodiment of the present 

20 invention, a good energy transfer donor to use is ethidium 
bromide or homodimer, since these f luorochromes bind tightly 
and the fluorescence yield increases dramatically upon 
binding. It should be noted that a potential problem using 
this method is that free fluorochrome can act as a donor, 

25 though probably not an effective one. If the presence of 
free chromophore does in fact become a problem, the filament 
being imaged can be split into two parts arid fluorescent 
beads can be attached in a head-to-head configuration to 
serve as an acceptor-donor pair for energy transfer imaging. 

30 Another concern when using this method is that latex 

beads are prone to aggregation. This problem can be solved 
using an appropriate selection of chromophores as provided 
by, for example. Molecular Probes, Inc., Portland, Ore. 
Additional measures to be used against aggregation include 

35 maintaining some charge on beads through careful attention to 
ionic strength, and use of Triton X-100 detergent or BSA. 
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In the following step of the method the molten RecA- 
bead-DNA mixture is stained with 6-diamino-2 phenylindole 
dihydrochloride (DAPI) and spread on a microscope slide for 
optical mapping. Next, length and intensity measurements are 
5 used to map the .bead positions, as discussed in the following 
sections. In this step, "Red" beads, such as those supplied 
by Molecular Probes, Inc., can be used to provide contrast to 
DAPI's blue fluorescence. 

A large number of labeled RecA filaments can become 

10 another concern in optically based methods, because too many 
free fluorescent beaded filaments can obscure imaging beads 
present in the complex with target molecules. In accordance 
with a preferred embodiment of the present invention, the 
following steps can be taken to eliminate this problem, if it 

15 occurs: 

(A) Carefully titrate the labeled filament and balance 
the minimum necessary hybridization efficiency for convenient 
observations against contrast quality. RecA-mediated 
hybridization does not require the RARE methylation and 

20 restriction enzyme cleavage steps, so that hybridization 
efficiencies do not have to be critically optimized for 
acceptable results. 

(B) Unbound filaments can be diffused out through 
dialysis, or mild electrophoresis in gel fixed systems could 

25 selectively sweep filaments from the viewing field and leave 
the much larger target -filament complexes in place. 
Additional RecA protein can be added for stabilization, if 
necessary. 

The discussion above is not meant to provide an 
30 exhaustive list of molecular imaging techniques. Other 
techniques can also be used, as known in the art, if 
necessary. 

5.2.2. SIZING TECHNIQUES 
35 Methodologies for quantitative measurements of physical 

parameters associated with single nucleic acid molecules are 
of critical importance in virtually every aspect of physical 
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genomic analysis. Especially valuable are techniques for 
sizing single DNA molecules or fragments obtained from 
restriction digestions that can be used for construction of 
high resolution restriction maps. Although pulsed 
5 electrophoresis has been shovm to adequately separate large 
DNA molecules, accurate sizing remains problematic in a 
variety of other settings, and independent size measurements 
using parallel methods are often lacking. 

In accordance with one aspect of the present invention, 

10 several different methods are proposed for measuring the size 
of nucleic acid molecules. These methods can be broadly 
classified into two groups: (a) techniques in which the 
measured molecule remains static during the measurement 
period; and (b) techniques in which the size of the molecule 

15 is determined using dynamic measurements that require 
molecular perturbation. 

Static sizing techniques in accordance with the present 
invention generally include measurements of the relative 
fluorescence intensity of imaged molecules and measurements 

20 of their apparent length. Static sizing is convenient to use 
because it does not require very sophisticated equipment and 
is well suited for high -throughput parallel measurements. 
(See Section 5.4). On the other hand, dynamic, or 
perturbation-based sizing techniques, while at present being 

25 less suited for high- throughput measurements, sometimes 
provide superior results in terms of information content, 
precision and resolution. 

5.2.2,1. STATIC MEASUREMENT TECHNIQUES 
30 Static molecular sizing techniques are based on fixing 

the molecule to be measured on a plane surface, staining it 
with fluorescent dye, obtaining an image of the molecule and 
measuring parameters of the imaged molecule which have known 
correlation to the parameters of interest. In accordance 
35 with the present invention when used in a static measurement, 
molecules to be sized are first elongated and fixed on a 
plane surface using any of the methods described in Section 
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5.1 above. Restriction enzymes can also be added, if 
required, to enable the digestion of the fixed molecule. In 
such case, magnesium ions are diffused in, triggering 
digestion, after which restriction sites can be visualized as 
5 growing gaps in the elongated DNA molecules. This imaging 
approach is simple, effective, and has excellent sensitivity, 
since molecules can be visualized directly. In accordance 
with a preferred embodiment of the present invention, the 
molecules are elongated and fixed using the spotting approach 

10 described in more detail in Section 5.4, in which small 

droplets of solution are deposited in a regular grid manner 
onto a plane derivative surface and let dry. As shown, for 
example, in Fig. 26 A, B and C, after a spot dries, molecules 
remain elongated and fixed onto the surface in a "sunburst" 

15 pattern. 

5.2.2.1.1. Fluorescence Intensity Measurements 
In accordance with a specific embodiment of the present 
invention molecular sizing can be performed using 

20 measurements of the intensity of f luorescently stained 
molecules. This measurement approach is based on the 
observation that the size of a molecule is proportional to 
the amount of fluorescent dye it can absorb, which amount can 
be estimated by imaging the molecule. In a specific 

25 embodiment, the amount of fluorescent dye, and thus the size 
of the molecule, is determined using a measurement of the 
absolute fluorescent intensity. In this approach, however, 
the illumination source has to provide very stable and 
reproducible light output for the measurements to be 

30 accurate. Due to the fact that in practice absolute 

intensity measurements require precise calibration of the 
imaging equipment, and often are inaccurate, the size of the 
molecule is preferably determined by measuring its relative 
intensity compared to the intensity of a standard, i.e. . a 

35 molecule of known size within the image field. In practice, 
a standard frequently consists simply of a portion of the 
imaged molecule. 
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In accordance with one aspect of the present invention, 
the accuracy of f luorecscence intensity sizing can further be 
increased by providing a series of standards of different 
sizes and comparing the measured molecule to each individual 
5 standard. The size of the molecule being measured can thus be 
determined by combining all congruent size measurements, 
i.e. . homologous restriction fragments within different 
molecules, and averaging the results. As known in the art, 
this operation reduces the standard deviation of the sizing 

10 error in proportion to the square root of the number of 

measurements taken. In order to generate the desired series 
of standards, in one important aspect of the present 
invention restriction enzymes can be used to cleave a known 
molecule into a sequence of fragments with physical 

15 dimensions which can be known to within a single base pair. 

In accordance with a preferred embodiment of the present 
invention, a relative intensity sizing measurement involves 
obtaining of a digital image of the molecule being sized and 
of a standard, as defined above. High resolution, i.e. , IK x 

20 IK images with 16 bit gray level resolution are used in a 

preferred embodiment. If necessary, flat field correction of 
the digital image can be used to equalize the illumination 
intensity level over the image field. The method further 
involves the following steps: applying median filtering or an 

25 equivalent filtering operation to remove spot noise, if 

necessary; thresholding the resulting image to obtain binary 
images corresponding to the contours of the imaged molecules; 
applying background correction to remove the pixel intensity 
which corresponds to the background level of illumination for 

30 the image field; and measuring the relative intensity of the 
molecules to be sized with respect to the intensity of the 
known standard. More specifically, the intensity of the 
molecule to be sized is measured by adding the intensities of 
all pixels within the molecular contours obtained in the 

35 binarization step. Comparing the intensity measurement of 
the molecule being sized to the intensity of the standard (s) 
determines the relative molecular size. Thus, if the 
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underlying size of the standard is known, the absolute size 
of the molecule can be determined directly. The functional 
relationship between the relative image intensities and the 
molecular mass is approximately linear ( i.e. . the relative 
5 intensity is proportional to M^) . 

A drawback associated with the relative intensity sizing 
approach of the present invention is that, depending on the 
fixation, the measurement errors often tend to be absolute 
instead of being relative. This means that, for example, a 

10 20 kb standard deviation applies to a 60 kb fragment as well 
as to a 900 kb sized one. In other words, the coefficient of 
variation (the ratio between the mean size and the estimated 
standard deviation) can vary enormously and, as a result, the 
accuracy of measuring small fragments is reduced 

15 disproportionately compared to measuring larger fragments. 
The use of improved f luorochromes and better camera 
equipment, as described in Section 5.4 next, can help 
alleviate this problem. 

Experimentally, the lower size limit of the relative 

20 intensity optical mapping is about 300 bp, which limit can 
further be extended to smaller fragments by using sample 
averaging over a series of identical measurements. As 
discussed below, if the initial images are of good quality 
and are relatively noise- free, the accuracy of the method 

25 using fluorescence microscopy of DNA fragments can be 
increased to a single bp. 

An important advantage of relative fluorescence 
intensity measurements over the contour length approach 
discussed next is that molecules do not have to be perfectly 

30 stretched, because the method depends on the relative 

fluorescence intensity, which in turn is determined by the 
amount of absorbed dye and thus does not change much even if 
the fixation is not perfect. On the other hand, the accuracy 
of contour measurements depends on optimal fixation which is 

35 undesirable in some instances. 

Finally, in accordance with the present invention, 
significant improvement of the relative fluorescence 
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intensity measurements can be achieved by using sequence- 
specific f luorochromes, such as DAPI (which prefer AT 
sequences) or ethidium bromide (which favor GC regions) , to 
differentiate between similarly sized fragments of a 
5 molecule. In particular, a non-specific fluorochrome 
measurement can be made first, as described above. Next, 
DAPI can be used to discriminate between different fragments 
allowing size differences to be quantified to within a single 
bp. 

10 

5.2.2.1.2. Contour Length Measurements 
According to a second embodiment of the present 
invention, another way of measuring the size of static 
nucleic acid molecules is to image the molecules and estimate 

15 their contour length by measuring the length of the 

corresponding digitized images. As discussed below, the 
length of an imaged molecule provides an adequate estimate of 
the size of the molecule. 

As known in the art, objects in a digitized image can in 

20 many instances be characterized satisfactorily by structures 
composed of line and arc patterns. In accordance with the 
present invention, morphological image processing can be 
applied to obtain a quantifiable topological representation 
of the molecules being sized. Morphological processing in 

25 the context of this invention refers to operations where the 
imaged molecule is represented as a set of structural 
elements, such as lines and arcs, and thereby can be 
represented by a simplified but more revealing shape. 

In a specific embodiment of the present invention, the 

30 parameter of interest is the length of the imaged molecule 
which may not be entirely stretched. In order to estimate 
the length of the imaged molecule, following the image 
correction and binarization steps discussed above, in 
accordance with a preferred embodiment of the present 

35 invention algorithms known as "thinning" can be used to 

reduce the imaged molecule into a set of simple digital lines 
and arcs, which lie roughly along the medial axes of the 
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molecule. (The medial axis of an object is defined as the 
set of points which are equidistant from the nearest boundary 
point of the object) . More specifically, the digital image 
of a molecule being sized can be thinned using an image 
5 processing operation knovm as erosion, which consists of 

deleting border pixels that have more than one neighbor pixel 
which belongs to the object. (Jain, Fundamentals of Digital 
Image Processing, Prentice Hall, 1989) . Once the medial axis 
of an object, such as the imaged molecule, is determined, its 

10 apparent length can easily be computed by simply counting the 
number of connected pixels which belong to the axis. It can 
be appreciated that the contour length measurement method 
approach resembles the measurement of the length of a rope 
and because it is simple to implemient can also easily be 

15 automated. 

In accordance with the present invention the apparent 
length of the imaged molecule is next used to derive 
additional molecular parameters of interest by comparing it, 
for example, to the length of a known standard. In an 
20 alternative embodiment of the present invention, if the 
magnification of the system is known, the length of the 
digital image of the molecule can be converted directly to kb 
measurements . 

Contour length measurements have been found in some 
25 cases to be more accurate than the relative intensity 

measurements described in section 5.2.2.1.1 above, especially 
for small size molecules. The reason is that, as shown in 
Example 4 below and in Figs. 4 and 5, fluorescence microscopy 
can image single polymer molecules stained with an 
30 appropriate chromophore and provide a distinguishable outline 
of the molecule being imaged. Thus, even though the 
molecular diameter dimension may only be about 20 angstroms, 
single molecules can still be visualized easily on the basis 
of their apparent contour. On the other hand, the intensity 
35 of the fluorescent light from the molecule may not be 

sufficiently distinguishable from the background intensity, 
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in which case the relative intensity measurement method will 
give less accurate results. 

As in the case of relative intensity size measurements, 
size measurements using the contour length approach vary 
5 approximately linearly ( i.e. . proportional to MM and are less 
sensitive compared to the dynamic measurement methods 
discussed below, which can have M^-^"^-^. However, molecular 
measurements using static approaches are particularly 
suitable for high throughput systems and can be used for fast 

10 sizing and ordering of DNA fragments, such as restriction 
digests, as described in detail in Example 10, since the 
measurement time is essentially limited to the length of the 
imaging process. In addition, static measurements are simple 
to implement because they do not require complex molecular 

15 perturbation techniques, and require no specific flow and/or 
electrical field arrangements. 

In a specific implementation of both static sizing 
methods discussed above, image measurements can be performed 
using digital images having 16 bit gray level resolution. In 

20 particular, the original raw digital image is displayed in an 
enlarged format using, for example, pixel replication, and an 
overlay image is prepared by manually tracking the DNA 
contour. The contour length map in accordance with the 
embodiment described in Section 5.2.2.1.2. above can be 

25 prepared from this overlay directly. In a preferred 

embodiment of the fluorescence intensity measurement approach 
(Section 5.2.2.1.1.), the 13 -bit raw image data is smoothed 
and the overlay image is dilated five times to cover all 
foreground pixels. For each pixel marked on the overlay as 

30 being part of the molecule, a synthetic background level is 
calculated as the weighted average of the surrounding pixels, 
with weight factors decreasing with distance, and equal to 
zero for the marked pixels. For example, a 3x3 or a 5x5 
window can be used for this purpose, with coefficients 

35 determined to add up to unity, as known in the art. 

Next, the intensity of a particular molecule or DNA 
fragment can be determined by subtracting the sum of the 
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matching background pixel intensities from the sum of all 
pixel intensities which belong to the fragment. This 
measurement can be repeated for each frame of raw image data 
that had an overlay image, excluding those frames with poorly 
5 focused images.^ To increase the accuracy of the experiment, 
intensity measurements are averaged over several images 
(e.g., over 5 images). The same measurement approach can 
also be used to measure the relative sizes of two different 
fragments. In this case, if the length (or the relative 
10 intensity) of one fragment is labeled x, and the same 

measurement for the other fragment is y, the relative sizes 
of the two fragments can be calculated simply as: 
SIZEi = x/(x+y); SIZE2 = y/(x+y); 

Analogously, if one of the fragments ( e.g., y) is later 
15 cut into sub- fragments u and v, the size of fragment u, for 
example, is computed as 

SIZE, = [u/(u+v)] [y/(x+y)] ; 

For a series of cuts, the relative size of each segment 
is computed in accordance with the present invention 
20 analogously as the ratio of the segment measurement (x) over 
the sum of all fragment measurements. 

5.2.2.2, DYNAMIC MEASUREMENT TECHNIQUES 

Measuring fragment sizes using dynamic relaxation has 

25 important advantages over the static methods discussed in 
Section 5.2,2.1. above. The reason is that in static sizing 
it is sometimes critical that the molecules are optimally 
stretched. Overstretched or suboptimally elongated molecules 
cannot be measured accurately using absolute- length based 

30 static measurements because the functional relationship to 
the molecular mass in this case is dependent on the level of 
elongation. (However, relative size measurements, as 
described in the preceding sections, are immune to the level 
of molecular stretching) . In addition, a specific problem 

35 encountered using stationary sizing methods is that if the 
fixation is imperfect inadequately fixed fragments are prone 
to premature relaxation. This in turn can adversely affect 
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the accuracy of the sizing measurement. On the other hand, 
strong fixation of the DNA to the surface typically 
interferes with the observation of cut sites, which generally 
requires local relaxation to produce visible gaps. (The 
5 facultative fixation technique described herein, however, 
adequately addresses this problem) . 

In contrast, dynamic measurements of DNA molecules do 
not always require molecules to be completely stretched out 
in order to obtain accurate measurements. Thus, for example, 

10 molecular relaxation time measurements are typically 

independent of the degree of coil extension. This important 
feature of the dynamic measurements has been demonstrated, 
for example, in measuring DNA relaxation times using the 
Massa visco-elastic technique. 

15 Additionally, parallel dynamic measurements can be made 

using molecular imaging techniques (e.g., fluorescence 
microscopy) , and size distributions can be determined 
accurately since the conformational dynamics of each molecule 
can be measured separately. Finally, a compelling reason for 

20 using dynamic relaxation methods is that the associated 
relaxation times (t) are strongly size dependent, with t 
being proportional to the molecular weight according to 
M^*^'^'^, so that size discrimination is much improved and also 
ultimately more accurate compared to the static methods 

25 considered above. Naturally, the actual molecular size 
dependence will vary with the chosen relaxation mode. The 
following sections discuss various dynamic molecular sizing 
methods in accordance with different embodiments of the 
present invention . 

30 

5.2.2.2.1. Optical Contour Maximization (OCM) 
The OCM molecule sizing method of the present invention 
is based on the observation that when a linear DNA molecule 
snags an obstacle during electrophoresis in a loose gel 
35 matrix it elongates nearly completely to form a metastable 
hook that can persist for several seconds. Such loose matrix 
can be formed, for example, at the coverslip-agarose gel 
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interface, as described in Section 5.1.1. The gel-coverslip 
interface in this case consists of a loose matrix, a few 
microns deep, which is ideal for OCM sizing use because it 
provides a convenient series of "pegs" for DNA molecules to 
5 ensnare and form hooks upon ( See Fig. 9) . A relatively weak 
electrical field (e.g., 5-30, volts/cm) is sufficient for 
complete elongation of a tethered or temporarily snared DNA 
molecule. If the hook arms are similarly sized, the 
molecule can be stretched out to nearly its full contour 

10 length which can then be measured easily using standard 
imaging techniques. In accordance with a preferred 
embodiment of the present invention using OCM sizing, the 
longest observed hook contour length is determined from a set 
of rapidly collected images and digitally processing the 

15 resulting images to determine the length of the molecule. 

Unlike the static contour length measurements approaches 
discussed above, the degree of molecular elongation using OCM 
is optimal. In fact, the maximal contour lengths determined 
in this method show linear correlation to the reported size 

20 in the 240-680 kb interval. OCM sizing accuracy and 

precision is very high, as good as or even better than pulsed 
electrophoresis based measurements. A disadvantage of this 
approach is, however, that in order to complete the 
measurements, a series of consecutive images must be taken in 

25 order to capture the optimum molecule elongation before it 
leaves the visual field due to the applied electrical field. 

5.2.2,2,2. Viscoelastic Sizing Methods 

Viscoelastic measurement techniques used in accordance 

30 with a preferred embodiment of the present invention are 
based on perturbing the coil conformation and measuring the 
time it takes the molecules to return to random states. The 
measured relaxation times are quite sensitive to molecular 
weight and vary approximately as M^-^^ In this measurement 

35 approach, within a given size distribution the largest 
molecules dominate the measured relaxation, so that size 
mixtures typically cannot be fully analyzed. 
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In a specific embodiment of the present invention, coil 
relaxation is measured in gels and in free solution for size 
determination of heterogeneous samples. In particular, 
fluorescence microscopy can be used to monitor coil 
5 conformational relaxation kinetics to rapidly size large 
single molecules (in gels). In this respect, it has been 
shown that coil conformational dynamics can be measured in a 
solution, yielding reliable average molecular dimensions that 
can be related easily to size. 
10 In a different embodiment, coil relaxation can be 

measured in agarose using morphological image analysis. As 
known, the coil relaxation size dependency in gels is 
superior to that in a solution, in particular it is 
proportional to M^"^ (as predicted by reptation theory) . DNA 
15 molecules from mammalian chromosomes, however, can be 
difficult to measure because their relaxation times are 
extraordinarily long, even in a solution. For example, if a 
100 Mb sized molecule has a measured relaxation time of over 
7 hours, a whole day will be needed to collect all the 
20 necessary data. it is estimated that the relaxation times 
are increased 10 to 50 fold in gels as compared to solution, 
in which case the experiment can last several weeks or even 
months. In order to increase the accuracy of the 
measurements and speed up the experiments, typically one can 
25 perform parallel experiments which can be averaged out. 

In accordance with another specific embodiment of the 
invention, the time to return to a random conformation can be 
shortened using a "twitch" technique to distort the molecule 
only slightly. The measured relaxation time using twitching 
30 has been shown to be the same as if the coil was fully 
distorted. Essentially, in this preferred embodiment the 
total relaxation time is equal to the perturbation time and 
thus takes much less time to measure. 

In another specific embodiment of this invention, free 
35 solution measurements can be made using relatively mild 
electrical field strengths (40 volts/cm) to perturb 
conformation. in this embodiment, molecules are suspended in 
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solution, mounted on the microscope, electrically perturbed, 
and the resulting relaxations are monitored by fluorescence 
microscopy and digitally recorded by an image processor. 
Morphological analysis of these images can be used to track 
5 the relaxation process by automatically characterizing 
molecular shapes, j..?. , by fitting ellipsoids around the 
image of the relaxing coil, as described next. 

Both experimental and theoretical studies of DNA 
conformation during gel electrophoresis show that a DNA 

10 molecule stretches out to form long hooks, which relax back 
to a compact conformation in a cyclically occurring fashion. 
Hook formation can be used to stretch DNA molecules out so 
that when the perturbing electrical field is shut off, 
relaxation kinetics of single molecules can be quantified by 

15 simply imaging them and measuring the length changes. In 
principle, this measurement is similar to stretching out a 
spring, releasing it and measuring its recoil kinetics by 
watching it shrink back to a relaxed state. 

To perform this measurement, large DNA molecules, 

20 stained with ethidium bromide, are embedded in 1% agarose and 
mounted on a epif luorescence microscope, equipped with a SIT 
camera (a low light level sensitive device) and interfaced to 
an imaging board set of a computer. Electrodes in the 
microscope chamber are pulsed so that molecules form hooks, 

25 and their lengths are measured automatically during 
relaxation by a special program written in NIH image 
macroprogramming language available from Wayne Rasband at 
wayne@helix.nih.gov. The relaxation of the DNA molecules 
starts when the applied field is shut off. In a specific 

30 example of yeast chromosomal DNAs, single exponential 

relaxation times are calculated for a series of molecules and 
are graphed as a In-ln plot versus size. The slope of this 
line gives the molecular weight dependency for t, the 
relaxation time (T)=constant (size)^ ''^ (kb) . 

35 In accordance with a specific embodiment of the present 

invention, fast coil relaxation times that correspond to 
Zimm-Rouse relations normally encountered in solution can be 
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initially measured. In a gel matrix, a stretched out DNA 
molecule with length L(t) {this is actually the length of the 
primitive tube, is known to relax as <L (t ) >=Aexp ( -t/r) +<Le> 
where t is the relaxation time, t is time and the brackets 
5 represent an ensemble average. L{t) in the formula above can 
be interpreted as the apparent molecular length as imaged by 
the microscope. Le is the equilibrium molecular tube length 
and is measured as a plateau region in an exponential decay. 
L forms the basis of the baseline sizing methodology, as 

10 discussed below. 

Image collection procedures for the visco-elastic sizing 
methods are virtually identical to those described in the 
previous sections so that the same images can be used for 
both length and relaxation measurements. In this approach, 

15 the morphological analysis uses image processing routines to 
fit ellipsoids around the image of the relaxing coil mass. 
In accordance with the method, the associated major and minor 
axes of the fitting ellipsoids are used to estimate the 
relaxation progress. A set of molecules can be used to 

20 benchmark and establish, relaxation dependent sizing 

conditions. Statistical analysis can be used to determine 
the precision and accuracy of these measurements. The 
functional dependence of the molecular size to the relaxation 
time is approximately M^-^. 

25 

5,2.2.2.3. Baseline Measurement Sizing 
In accordance with another embodiment of the present 
invention, single molecule sizing can be performed using what 
is known as "baseline" measurements. Specifically, typical 

3 0 DNA relaxation plots showing apparent length versus time 
dependencies provide plotted points that are averages of 
several (usually 4-5) relaxation measurements. Such plots 
show that the measured length of the molecule decreases in an 
exponential fashion and, importantly, that the molecule does 

35 not fully relax to a spherical random conformation. Instead, 
a quasi-equilibrium structure is formed which resembles a 
thickened, short rod-like object. The formation of such 
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object signals an end of the exponential decay. Very slow 
relaxation processes are still happening, but they are of a 
different nature and develop on a different time scale, which 
could be proportional to the cube of the molecular size, 
5 i.e. , to 

Within the time scale actually used (e.g., hundreds of 
seconds) , length measurements settle down to an equilibrium 
value which is termed the "baseline". Baseline values vary 
linearly with DNA size and are very reproducible. In a 

10 preferred embodiment of the present invention, relaxation 
measurements yield molecular size estimates in two 
independent ways: (1) by determination of the relaxation 
time, T, and (2) by length measurements for baseline 
determination. Thus, the two measurement approaches could be 

15 used simultaneously to derive different, independent 
estimates of the molecular size. 

More specifically, the procedures for carrying out the 
relaxation measurement in accordance with the baseline 
measurement method of the present invention are as follows: 

20 (1) Apply an electrical field and keep the selected 

molecule in the visual filed by switching field orientation. 
When a hook is formed, turn off the electrical field 
immediately before one hook arm is pulled off from the apex, 
and then start collecting images. Proper imaging requires 

25 that the entire molecule be in focus. 

(2) Collect images every 10 or 20 seconds using 8 or 16 
video frame -averaging to reduce noise. Up to about 50 images 
for each measurement are necessary. 

(3) Repeat steps (1) and (2) for a given molecule as 
30 many times as possible for subsequent data averaging. 

(4) Analyze and process each of the resulting images. 
(Processing steps can include noise reduction, smoothing and 
skeletonization to produce suitable images for binarization, 
so that an automatic analysis algorithm can operate on the 

3 5 images) . Next, extract length parameters and obtain the' 

molecular relaxation plots h^it) , where i is the image number. 
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(5) Add all relaxation plots of a given molecule 
together to perform ensemble average, i.e. . determine <L(t)> 
over all images. Determine the baseline <L> from the end 
plateau of the relaxation curve and fit the curve using the 
5 expression 

<L{t)>=A exp(-t/T) ^ <L> 
I to obtain an estimate of the relaxation time t. 

The method steps above can be implemented in a specific 
embodiment of the present invention using a Zeiss Axioplan 
^ 10 epifluorescence microscope with #15 filter cube (green 

excitation, red observation), and Pol Plan-Neof luar lOOxz and 
> 63x 1,30 numerical aperture objectives (for larger 

molecules) . The distance per pixel can be calibrated using a 
USAF-1951 resolution target, and was determined in a specific 
15 embodiment to be 0,217 fim and 0.345 /im respectively. A 6115A 
precision power supply (Hewlett-Packard) can be used to 
provide potential across the chamber electrodes. Frames from 
a C24 00-SIT camera (Hamamatsu) can be averaged by 
PixelPipeline (Perceptics) , digitized (480x512x8 bits) and 
20 stored in a Macintosh Ilfx computer. Averaged images are 
preferably processed to remove background, reduce noise, and 
simulate shadowing (some images) using NIH Image program, 
available from wayne®hel ix.nih.gov and NCSA Image software 
for Macintosh (available from softdev@ncsa.uiuc.edu), and 
25 photographed by a Polaroid film recorder. 

Sample preparation using the baseline measurement method 
of the present invention generally is prone to variations 
that can affect the results. For example, small gel samples 
*? are melted and reformed within a thin region between a slide 

30 and a coverslip. Evaporation can also be a problem. Despite 
I these concerns, measurements made using this approach were 

found to be reproducible, particularly if fluid adhering to 
the gel slices containing DNA is removed prior to melting. 
Uniform gelation conditions must also be stringently 
35 followed. 

In a specific application of the baseline sizing method 
>i in accordance with a preferred embodiment of the present 
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invention, yeast chromosomal DNA can be resolved by Pulsed 
Oriented Electrophoresis in 1% Seakem low melting agarose 
(FMC) , l/2x TBE (42.5 mM Tris, 44 . 5 mM boric acid, 1.25 mM 
disodium EDTA) . To this end, excised gel bands, or 
5 alternatively -a synthetic matrix, are repeatedly equilibrated 
in TE (10 mM tris, 1 mM EDTA, pH 8.0) (19) . Bands are further 
equilibrated in TE containing 10 mM NaCl, melted 72°C, 10-15 
min and equilibrated to 21^C. Ethidium bromide (final 
concentration 1 /zg/mL) and 2-mercaptoethanol (final 

10 concentration 10 fiL/mL) for minimizing photodamage are added 
to melted sample, equilibrated at 37^0 from 10 min to a few 
hours. To prepare the final sample, 10 (cutoff yellow 
pipette tip used) is cast onto a preheated slide with 1.8 cm 
X 1.8 cm coverslip and applied to a stage electrophoresis 

15 chamber, with 2 cm electrode spacing. The edges of the 

coverslip are sealed with mineral oil to prevent evaporation. 
Coverslips and slides are cleaned by boiling in 0.075M HCl 
for about one hour, rinsed with distilled water several 
times, and stored in 100% ethanol before use. Mounted 

20 samples are incubated at 4**C for at least 15 min before image 
collection at 37*C. 

Following the method steps above, relaxation time 
determinations can be made more accurate by averaging about 3 
to 8 measurements. For each curve, the length Li is 

25 determined from the last 15 data points, which are then used 
along with the first 20 or 30 data points to extract the 
relaxation time t. The distribution of the measured Li is 
relatively narrow and the standard deviation is less than 
<L>-l/2 . 

30 Figure 27 A and B show relaxation measurements as a 

function of molecular size (245-980 kb) and the parameters 
extracted from each. All curves can be seen to fit 
reasonably well a single exponential decay. Disengagement 
from the tube is not significantly observed from the figures. 

35 Figure 28 A and B are plots of relaxation vs. size 

respectively. By fitting the experimental measurements to 
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the adopted mathematical models, the following two 
relationships were obtained: 

<L> (pixel)=0.345 SIZE (kb) - 32, (1 pixel = 0.27/im) 
T (second) =0.017 SIZE^ '*Mkb) . 
5 Notably, the relationship for <L> has a negative 

intercept after fitting data for a wide range of molecular 
sizes. For small molecules the relationships are <L> = SIZE'' 
with v=l/2. Values for v depend on molecular size and range 
from 1/2 to nearly 1 for large molecules. 

10 

5.2.2.2.4. Other Molecular Sizing Methods 
Additional aspects of the present invention involve 
molecular sizing using different measurement techniques. In 
a specific embodiment, molecular sizing is performed by 

15 measuring the reorientation time of a molecule subject to at 
least one external force using, for example, sequential 
electric fields applied in different directions. This 
approach is described in Example 6 below and is illustrated 
in Figs. 4A and 5J. Using the process as described below in 

20 the Examples, it has been determined that during pulsed field 
electrophoresis, the blob train of a DNA molecule orients 
with the applied electric field in a very complicated manner 
and during this process, electrophoretic mobility is retarded 
until alignment is complete, e.g., until the molecule is 

25 aligned with the applied field. Upon field direction change, 
the blob train moves in several new directions simultaneously 
^ X'^' , the blobs appear to be moving somewhat indepen- 
dently) . Eventually, some part of the blob train dominates 
in reorienting with the applied field and pulls the rest of 

30 the blobs along its created path through the gel. The time 
necessary for complete blob train alignment varies directly 
with size; i.e.,, a 10 mb (lmb= 1,000 kb) molecule requires 
one hour to reorient, while a 10 kb molecule requires only 
ten seconds, using similar field strengths. The phenomenon 

35 is illustrated in Fig. 4. Reorientation is measured in 

various ways, including by light microscopy and by microscopy 
combined with spectroscopic methods. 
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Another embodiment of this invention involves 
measurement of the rotation time of a molecule subject to 
sequential electric fields in different directions. Rotation 
of a molecules using this approach requires a series of 
5 incremental reorientation steps, each of which causes the 
molecule to rotate further in the same direction, until the 
molecule has undergone a rotation of a specified angular 
increment, for example, 360®. This embodiment is 
particularly well suited to characterize stiff, rod-like 

10 molecules, such as small DNA molecules, which do not 

significantly change conformation upon application of an 
external force. Notably, however, large molecules can also 
be sized using this method if the conformation of the 
molecules is kept relatively constant, preferably in a rod- 

15 like or elongated conformation. In accordance with a 
specific embodiment of the present invention keeping the 
conformation of large molecules is accomplished by applying a 
pulsing routine which is appropriate to the size, shape and 
perhaps also the composition of the molecule. 

20 As a non-limiting example, molecules are rotated in the 

presence of sinusoidally varying electrical fields applied at 
90® to each other. Stiff, rod-shaped molecules or stretched 
molecules are rotated about, their long or short axis. 
Rotation about the long axis has the greatest molecular 

25 weight dependence, with rotation diffusion being proportional 
to about M^. Rotational motion of a rod-shaped molecule 
immersed in a gel or any other confining can be difficult if 
an attempt is made to simply rotate the molecule as a boat 
propeller rotates in water. When a gel is used, the matrix 

30 affects the rotation of the molecule much as seaweed affects 
the rotation of a boat propeller. Thus, a pulsing routine is 
applied which also provides back and forth motion of the 
molecule, thereby facilitating rotation. 

Generally speaking, an algorithm defining the pulsing 

35 routine can depend on variables such as the angle increment, 
time, electric field intensity, etc., and these may in turn 
be functions of different variables. Thus, numerous types of 
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algorithms can be used in accordance with this embodiment of 
the present invention. 

In a preferred embodiment, the pulsing routine used in 
the present invention is defined as follows 
5 Ei(t) = E (t , ej (icosei + JsinGi) (At) 

E2(t) = E {t^ej (icos(0i+7r) +lsin(0i + 7r) ) (At) 
Pi = Ki * Ei(t), K2 * Ejit), K, * El, (t) 
wherein 

Ei(t) and £3,^) are electric field vectors multiplied by 
10 time (volt . sec/cm) ; 

E(t,e^) is the electric field intensity in volt/cm; 
i and j are unit vectors; 

e, is the field angle, in radians or degrees, with i = 1- 
n, where n/E*ei/i = l = 2n or 360** for a complete rotation; 
15 At is pulse length, in seconds; 

t is time in seconds; 

ki and are the number of successive identical pulses; 

and 

P is a pulsing routine, which may be repeated. 

20 Using the above routine, a molecule to which appropriate 

pulses are applied rotates about (e,.i - 6 J radians or degrees 
when each set of pulses P are initiated. Also, the molecule 
is translated, moving laterally in the directions of E(t) and 
-E(t), thereby facilitating rotation. 

25 In the above equation, At is a constant, however, this 

need not always be the case. E may be a function of one or 
more variables. For example, E may be a function of total 
elapsed time and/or angle increment. Also, the sum of all 
the angular increments need not be 360°, and may be any 

30 number of partial or total rotations which provide 

measurements of sufficient accuracy. A specific set of 
conditions for measuring the rotation rate of molecules are 
set forth in Example 7 . 

In another embodiment of the present invention, sizing 

35 involves measuring the diameter of a relaxed molecule. 

Measurements of the molecular diameter are made according to 
the same procedure of staining molecules, placing the 
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molecules in a medium, etc., as in the case of the 
curvilinear length measurements described above. However, in 
this embodiment of the invention it is not necessary to 
perturb the molecules before the measurement. Instead, the 
5 molecules are measured when they are in a relaxed state and 
generally have a spherical or elongated elliptical shape. 
Because the volume of a sphere is proportional to where R 
is the radius of the sphere, and the volume of an ellipsoid 
is proportional to ab^ where a is the radius of the major 

10 axis, and b is the radius of the shorter axis, resolution for 
this technique varies functionally approximately as M-*^ 
Molecules measured by this technique need not be deformable. 
The technique can be used for all sizes of DNA molecules and 
in particular is useful for sizing both large DNA molecules 

15 on a microscope slide, and densely packed molecules. 

In accordance with the present invention, molecules can 
also be sized by measuring rotational diffusion in free 
solutions. The rod rotational diffusion coefficient is 
remarkably sensitive to size -- approximately length^ The 

20 equations describing rotational frictional coefficients are 
as follows: 

frot = 87n7LV3[(J-Y,^,)] , 

where rj is viscosity, Y^^ = 1-57 - 7(l/J - 0.28)^ and J = 
25 ln(2L/b); L and b are half of the rod long and short axes 
respectively. A useful expression for the molecular rotary 
relaxation time is given by: 
T, = f,ot/kT = 47rT?LV9[J - Y,^,)] , 

3 0 In accordance with a specific embodiment of the present 

invention rod rotational diffusion coefficients are 
determined using fluorescence dichroism, as measured by 
microscopy, of small (100-3,000 bp) single, ethidium bromide 
stained DNA molecules . Fluorescence dichroism tracks 

35 orientation as a function of time, providing the necessary 
kinetics information for coefficient determinations. 
Orientation analysis utilizes the equations above. An 
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advantage of the single molecule approach over standard bulk 
measurements is that the data is intrinsically size 
deconvoluted. 

The experimental setup in accordance with a specific 
5 embodiment of the present invention consists of a Zeiss 
microscope fitted with an ethidium bromide filter pack, 
illuminated by an argon ion laser source, providing 488 nm 
polarized radiation, a Hinds photoelastic modulator and 
detection by a microchannel plate detector interfaced to a 
10 CCD video camera. Camera output is provided to the image 
processor for data storage and analysis. A Fluke, high 
power/speed amplifier provides -^/-ISOO volts at the required 
frequency for alignment. Since the thin sample films used 
for microscopy draw little power, temperature control in this 
15 embodiment is relatively simple. Molecular alignment can be 
done tried using both AC or DC electrical fields. AC fields 
have the advantage of zero net translation during an 
experiment. If increased field strength is required, the 
sample cell can be reduced in size, bringing the electrodes 
20 closer together. 

To measure the rotational diffusion coefficient, an 
electrical field is applied briefly to orient molecules and 
shut off, allowing Brownian motion to relax the molecules. 
Depolarization is next tracked by gathering the total 
25 fluorescence decay output of each molecule in the field by a 
microchannel plate/CCD video camera. As the molecule tumbles 
and falls out of plane with the exciting radiation, its 
fluorescence intensity changes in an exponential fashion with 
a characteristic time given by the rotational diffusion 
30 coefficient. Since video cameras operate at a frame rate of 
30 frames per second, fluorescence intensities are recorded 
every 1/30 second by the image processor. The whole process 
is repeated several times and the results are averaged. An 
advantage offered by a video camera detection system is that 
35 an ensemble of individual molecules can be measured 

distinctly and simultaneously, resulting in parallel data 
collection and processing. The calculated relaxation time 
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for a 3 00 bp DNA molecule is about 4 microseconds in water, 
too fast for our detection system. But since the rotational 
diffusion constant increases linearly with viscosity, 
substituting 98% glycerol can be used to boost the viscosity, 
5 and chilling the sample on the stage can further increase 
viscosity by a factor of 10^ If high glycerol concentration 
causes DNA denaturation at low temperature, sucrose can be 
tried. Either approach should provide a viscosity boost 
sufficient to bring the rate into range for video data 
10 collection. 

Although rotational coefficients are usually determined 
in solution, it is known that DNA molecules less than 300 bp 
can freely rotate within an agarose matrix since their 
measured rotational diffusion coefficients are similar to 
15 free solution values. Embedding small DNA molecules in 
agarose during measurements can be used to stem any 
convective forces, should they be found to severely perturb 
measurements . 

20 5.2.3. METHODS FOR INCREASING THE MEASUREMENT ACCURACY 

This section provides a brief outline of statistical 
techniques used to increase the accuracy of molecular size 
measurements in accordance with the present invention. The 
methods are based on obtaining a series of estimates of the 

25 desired parameters and manipulating the parameter estimates 
in accordance with known statistical error analysis criteria. 
Conceptually, each measurement of the desired size of the 
molecule using any one of the methods described above can be 
interpreted as an estimate of the true quantity, which is 

3 0 free of measurement errors. There is no guarantee, however, 
that a specific measurement will not be grossly incorrect, in 
which case the estimated parameter is useless for analysis. 
A well known method to reduce this probability is to take a 
series of measurements and use the mean value (the sum of all 

35 measurements divided by the number of measurements) . On the 
other hand, a measurement of the sample variance gives an 
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estimate of how accurate the measurements are, i.e.,, how 
close they are to a hypothetical ideal value. 

It is well known that for a set of independent, normally 
distributed measurements, the accuracy of the measurement 
5 increases with the squared root of the number of the 

measurements, i.e.,, as sqrt(n), where n is the number of the 
measurements. Thus, obtaining the average of 10 independent 
measurements increases the accuracy of the size estimate by a 
factor of about sqrtdO) = 3.16. Statistical confidence 
10 intervals which determine the probability that a specific 
measurement deviates from the mean value can be used to 
estimate the consistency of the measurements, as known in the 
art. Thus, for example, probability density functions (pdf) 
for sample variations which are widely spread indicate 

15 inaccurate measurements (which can be discarded) , while 

highly peaked pdfs indicate that the sample bin is consistent 
and likely to be accurate. 

In accordance with a preferred embodiment of the present 
invention, averaging a series of measurements to increase the 

20 accuracy of the molecular size estimates is used in all 
cases, whenever possible. In addition, to further 
characterize the sample population, after the measurements 
are averaged, the 90% confidence interval on the mean 
measurement value is calculated using the t distribution with 

25 n-1 d.f . and the sample standard deviation, (See, for 

example, Bendat et al . , Random Data: Analysis and Measurement 
Procedures, John Wiley, 1986). This calculation assumes that 
the measurement data represents random samples from a normal 
distribution and means that there is a 90% chance that the 

30 population mean falls within the confidence interval. The 
midpoint of this interval is used to estimate the standard 
deviation of the population. Next, the coefficient of 
variation (CV) for the measurements can be obtained as the 
estimated population standard deviation divided by the sample 

35 mean. The pooled standard deviation for the measurement is 
computed in accordance with the present invention using the 
functional expression sqrt(the average of the variances). 
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Finally, the relative measurement error of the measurements 
in accordance with the present invention is computed as the 
difference between the measurement value and the reported 
value divided by the reported value. These and other 
5 relevant statistical measurements which are computed in a 
typical measurement using the methods of the present 
invention are of critical importance in increasing the 
accuracy of the sizing methods used, and in comparing the 
results to that of other sizing techniques. 

10 In accordance with the present invention, elongated and 

possibly fixed single nucleic acid molecules can be imaged 
using a number of techniques to generate digital images of 
the molecules. These images can then be processed to obtain 
quantitative measurements of molecular parameters of 

15 interest. To this end, in a preferred embodiment of the 
present invention, the molecules being imaged are first 
stained with f luorochromes which are absorbed by the 
molecules generally in proportion to their size. The size of 
the stained molecules can later be determined from 

20 measurements of the fluorescent intensity of the molecule 
which is illuminated with an appropriate light source. 

5.3. GENOME ANALYSIS/MANIPULATION 

25 Described herein are methods whereby the single 

elongated molecules of the invention may be utilized for the 
rapid generation of high resolution genome analysis 
information. Such methods include, as described below, both 
optical mapping and optical sequencing techniques. 

30 

5.3.1, OPTICAL MAPPING 

The optical mapping techniques of the invention allow 
direct, ordered mapping of restriction sites for the rapid 
generation of high resolution restriction maps. Briefly, 
35 such mapping techniques involve the elongation and fixation 
of single nucleic acid molecules, digestion of the molecules 
with one or more restriction enzymes and the visualization 
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and measuring of the resulting restriction fragments. 
Because the single nucleic acid molecules which are being 
digested are fixed, the resulting restriction fragments 
remain in register, such that their order is immediately 
5 apparent and a rapid restriction map is instantly generated. 
The optical mapping techniques described herein have a 
variety of important applications, which include, for example 
the efficient generation of genomic physical maps, which, 
until the present invention, had proven to be time consuming, 

10 costly, difficult and error prone. in fact, the approaches 
described herein make possible the creation of ordered, 
complex high resolution restriction maps of, for example, 
eukaryotic, including human chromosomes without a need for 
analytical electrophoresis, cloned libraries, probes, or PGR 

15 primers. 

Further, such techniques have wide ranging diagnostic 
applications. For example, nucleic acid from individuals may 
be tested for polymorphisms which may be associated with 
certain disease alleles. For example, such polymorphisms may 
20 represent restriction fragment length polymorphisms, 

rearrangements, insertions, deletions and/or VNTR (variable 
number tails repeats) . 

Nucleic acid molecules of from about 500 bp to well over 
1000 kb can efficiently be mapped by utilizing the techniques 
25 described herein. The single nucleic acid molecule -based 
techniques can easily be utilized in high throughput 
applications such as are described, below, in Section 5.4. 

For optical mapping, single nucleic acid molecules are 
elongated and fixed according to the techniques described in 
30 Section 5.1, above. While either agarose or solid surface- 
based elongation/fixation methods may be utilized, solid 
surface techniques are, generally, preferred. As discussed 
in Section 5.1, the elongation/fixation techniques should be 
optimized to yield a balance between elongation capability, 
35 relaxation capability and retention of biological function! 
By appropriate elognation and fixation, the single nucleic 
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acid molecules relax somewhat, with the fragments, therefore, 
moving apart upon cutting. 

Cleavage sites are, therefore, visualized as growing 
gaps in imaged molecules. The molecules are restrained, 
5 however, from fully relaxing to a random coil conformation, 
which would make accurate fragment measurement impossible. In 
addition to gaps, cleavage is also signaled by the appearance 
of bright condensed pools or "balls" of DNA on the fragment 
ends at the cut site. These balls form shortly after 

10 cleavage and result from coil relaxation which is favored at 
ends (see Figs. 13 and 15). Cleavage is scored more reliably 
by both the appearance of growing gaps and enlarging bright 
pools of segments at the cut site. Otherwise, it is possible 
that what appears to be a gap may, in fact, be a single 

15 molecule, part of which is out of the plane of focus. 

Optical mapping restriction digests may be performed by 
utilizing standard reaction mixtures and conditions ( e.g. . 
incubation times and temperatures) . Because the technique 
relies on the fixed nature of the nucleic acid molecules 

20 being digested, however, it is critical that the 

elongation/fixation process be completed prior to the 
initiation of restriction digestion. There exist a number of 
methods by which the start of restriction digestion can be 
controlled, a number of which involve keeping the restriction 

25 enzyme apart from whatever cofactor ( e.g. . Mg^*) is necessary 
for that particular enzyme's activity until the initiatin of 
digestion is desired. 

For example, when using agarose-based 
elongation/fixation techniques, the nucleic acid may be mixed 

30 into molten (preferably low melting) agarose along with 
restriction enzyme and appropriate buffer, but without 
cofactor. When the reaction is to begin, the cofactor can be 
added, thus activating the restriction enzyme. 
Alternatively, the cofactor can be mixed into the agarose in 

35 the absence of restriction enzyme. In order to begin 

digestion, the enzyme can be added and allowed to diffuse 
into the gel . 



- 68 - 



wo 96/31522 



PCT/US96/(M550 



When solid surf ace -based elongation/fixation techniques 
are used, restriction digestion reaction mixture, in the 
absence of either restriction enzyme or cofactor, can be put 
into contact with the solid surface. At the appropriate 
5 time, the missing component ( i.e. , . either the restriction 
enzyme or the cof actor) can be added to the surface. 
Alternatively, a complete reaction mixture can be introduced 
onto the solid surface, with digestion beginning once the 
mixture comes into contact with the elongated/fixed nucleic 

10 acid molecules. Additionally, a necessary divalent cation 
can be introduced in a chelated fashion wherein the chelation 
is a photo-labile chelation, such as, for example, 
DM-nitrophen. When the digestion is to begin, the chelator 
is inactivated by light, releasing the divalent cation which 

15 begins the digestion. 

It should be noted that not each of the restriction 
sites present on a given nucleic acid molecule will be cut 
simultaneously, meaning that not all gaps will appear at the 
same time. This is expected, given the variable rate of 

20 enzymatic cleavage exhibited by restriction enzymes. Rather, 
cuts usually appear within a short time, for example, 5 
minutes, of each other. 

The molecules being restricted and analyzed via such 
techniques may be visualized via techniques including those 

25 described, above, in Section 5.2. 

The resulting fragments can be sized according to 
techniques such as those described in Section 5.2. Such 
techniques can include, for example, a measure of relative 
fluorescence intensities of the products and by measuring the 

30 fragments' relative apparent molecular lengths. Averaging a 
small number of molecules rather than utilizing only one 
improves accuracy and permits rejection of unwanted molecules 
or fragments. Maps are then constructed by simply recording 
the order of the sized fragments. 

35 The mapping techniques described thus far function in 

the efficient generation of single nucleic acid molecule 
restriction maps. A knowledge of the orientation of these 
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individual molecules, however, would be very useful for the 
alignment of greater than one such restriction map into a 
large, ordered map, A variety of techniques may be utilized 
to distinguish or dif erentially identify one end of a 
5 molecule, thereby marking its orientation or polarity. 

For example, mapping vectors may be produced and used in 
conjunction with the mapping techniques described herein. 
Such vectors can serve to introduce a "tag" to one end of a 
molecule being analyzed. Such a tag can comprise, for 
10 example, a rare restriction enzyme cutting site, a protein 
binding site (which, for example, can be tagged by a labeled 
version of the protein) or a region of DNA tending to kink 
(and which would, therefore, serve as a visual tag requiring 
no further manipulation) , just to name a few. Further, such 
15 vectors may include a nucleotide sequence to which a labeled 
nucleotide probe may hybridize via, for example, techniques 
such as those described, below, in Section 5.3 •2. 

Size standards may additionally facilitate the accurate 
measurement of the restriction fragments which are generated 
20 herein. Such standards may, for example, be engineered into 
mapping vectors such as those described above. Methods, such 
as the methylation of the mapping vector, can be utilized to 
ensure that the siing standards remain intact during 
restriction. Alternatively, sizing standards may comprise 
25 fluorescent beads of different sizes which exhibit a known 
level of fluorescence. 

The successful use of the optical mapping techiques of 
the invention is demonstrated in Figure 12, which illustrates 
three types of ordered restriction maps produced by optical 
3 0 mapping of the present invention. These maps are compared 
with published restriction maps. Additionally, Fig. 13A-F, 
shows selected corresponding processed fluorescence 
micrographs of different yeast chromosomal DNA molecules 
digested with the restriction enzyme Not I. These images 
35 clearly show progressive digestion by the appearance of 
growing gaps in the fixed molecules. From such data, the 
order of fragments can be determined by, for example. 
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inspection of time-lapse images obtained at every time 
interval, e.g. . 0.07-200s, or any range or value therein, 
e.g. , 1-30S. Agreement is expected between the optical 
(length or intensity) and the electrophoresis-based maps, to 
5 be, and has, indeed, been found to be excellent. The third 
type of restriction map combines length- and intensity- 
derived data; small restriction fragments can be sized by 
length, whereas intensity measurements can provide the 
remaining fragment sizes needed to complete the maps. 

10 

5,3.2. OPTICAL SEQUENCING 

The elongated, fixed single nucleic acid molecules of 
the invention can be utilized as part of methods designed to 

15 identify specific, known nucleotide sequences present on the 
fixed nucleic acid molecules. Such methods are referred to 
herein as "optical sequencing" methods. In part because 
these methods involve the analysis of naked nucleic acid 
molecules ( e.g. . ones which are not in a chromatin state) , 

20 optical sequencing is capable of providing a level of 
resolution not possible with chromatin-based detection 
schemes such as in situ hybridization. Optical sequencing 
methods, in general, comprise the specific hybridization of 
single stranded nucleic acid molecules to at least one 

25 nucleotide sequence present within the single elongated fix 
nucleic acid molecules of the invention in a manner whereby 
the position of the hybridized nucleic acid molecule can be 
imaged and, therefore, identified. Imaging can be performed 
using, for example, techniques such as those described, 

30 above, in Section 5.2. The position of the imaged 

hybridization product can be identified using, for example, 
the sizing techniques described, above, in Section 5.2. 

As discussed above, the optical sequencing technique 
comprises the hybridization of nucleic acid molecules to the 

35 elongated, fixed single nucleic acid molecules of the 
invention such that specific hybridization products are 
formed, in a manner which can be imaged, between at least a 
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portion of the elongated, fixed single nucleic acid molecules 
and the hybridizing nucleic acid molecules. Because the 
hybridization is based on sequence complementarity between 
the hybridizing nucleic acid molecule and at least a portion 
5 of the elongated single nucleic acid, imaging of the 
hybridization product, coupled with the precise sizing 
techniques described in Section 5.2, above, optical 
sequencing rapidly identifies a nucleic acid region according 
to its specific nucleotide sequence. 

10 The optical sequencing techniques described herein have 

a variety of important applications. First, such techniques 
can be used to generate complex physical maps, by, for 
example, facilitating the alignment of nucleic acid molecules 
with overlapping nucleotide sequences. 

15 Second, such techniques make it possible to rapidly 

identify and locate specific genes of interest. For example, 
in instances where at least a portion of the nucleotide 
sequence of a gene is known, optical sequencing techniques 
can rapidly locate the specific genomic position of the gene, 

20 and further, can rapidly identify cDNA molecules which 

contain sequences complementary to the nucleotide sequence. 
Further, such optical sequencing methods have numerous 
diagnostic applications, such as, for example, the rapid 
identification of nucleic acid molecules containing specific 

25 alleles, such as genetic disease-causing alleles. For 

example, single elongated, fixed nucleic acid molecules from 
one or more individuals can be hybridized with a single 
stranded nucleic acid molecule probe which is specific for 
( i.e.. , will specifically hybridize to) an allele of 

30 interest. Such an allele may, for example, be a disease- 
causing allele. A positive hybridization signal would 
indicate that the individual from whom the nucleic acid 
sample was taken contains the allele of interest. 

Alternatively, the single elongated fixed nucleic acid 

35 molecules may represent nucleic acid molecules which are 
specific for ( i.e. , , will specifically hybridize to) an 
allele of interest. In such an instance, a nucleic acid 
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sample can be obtained from an individual and hybridized to 
the elongated fixed nucleic acid molecules. The presence of 
specific hybridization products would indicate that the 
individual from whom the nucleic acid sample was obtained 
5 carries the allele of interest. In order for the nucleic 
acid sample: single molecule hybridization products to be 
imaged, the sample nucleic acid may be labelled via standard 
methods, e^, by PGR amplification in the presence of at 
least one labelled nucleotide. Alternatively, as described 
10 below, the hybridization product need not be labeled, but can 
be identified by the imaging of a site-specific restriction 
cleavage event within the hybridization product. Further, as 
described, below, the hybridization product can be identified 
via indirect labeling, by hybridization product -specif ic 
15 binding of a labeled compound to the product. 

Conditions under which the introduced nucleic acid 
molecules are hybridized to the elongated fixed single 
nucleic acid molecules of the invention must be stringent 
enough to yield only specific hybridization products. 
20 "Specific", as used in this context, refers to nucleotide 
sequence specificity, and a "specific hybridization product" 
refers to a stable nucleic acid complex which is formed 
between at least a portion of the elongated nucleic acid 
molecule and at least a portion of the introduced nucleic 
25 acid molecule which is complentary to the elongated nucleic 
acid molecule. The sequence complentarity between these two 
hybridizing portions of the nucleic acid molecules is at 
least about 80%, with at least 90% being preferred, and at 
least about 98-100% being most preferred. 
30 Hybridization conditions which can successfully yield 

I the specific hybridization products described above are well 

" known to those of skill in the art. First, apart from RecA- 

mediated methods (see below) , the fixed, elongated nucleic 
acid molecules must be denatured (made single stranded) such 
35 that hybridization to the introduced single stranded nucleic 
acid molecule is possible, by following standard denaturation 
protocols which are well known to those of skill in the art. 
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The specific hybridization products formed must be 
imaged in order to, first, identify that such products have 
formed, and, in some cases, to identify the postion along the 
elongated fixed nucleic acid molecule at which such 
5 hybridization products have formed. A variety of methods may 
be utilized for the imaging of the specific hybridization 
products formed during the optical sequencing techniques 
described herein. 

First, the nucleic acid molecules which are hybridized 

10 to the elongated, fixed single nucleic acid molecules of the 
invention can be labeled in a manner whereby the 
hybridization products they contribute to can be imaged. Any 
of a number of standard labeling techniques which are well 
known to those of skill in the art may be utilized. These 

15 include, but are not limited to, colorimetric, fluorescent, 
radioactive, biotin/streptavidin and chemiluminescent 
labeling techniques, with fluorescent labeling being 
preferred. In instances wherein the elongated, fixed nucleic 
acid molecules are colorimetrically or f luorescently stained, 

20 the labeled nucleic acid which hybridizes to the elongated 
molecule should be labeled in a manner which produces a 
different color or fluorescence than the stained elongated 
molecule. The labeled nucleic acid will generally be at 
least about 20 nucleotides in length, with about 100 to about 

25 150 nuelcotides being preferred. Specific hybridization 
products can be imaged by imaging the labeled nucleic acid 
within the hybridization product. 

Second, methods may be utilized which obviate the need 
to label the nucleic acid which hybridizes to the elongated, 

30 fixed single nucleic acid molecules of the invention. For 
example, optical sequencing methods may be used in 
conjunction with a technique known as the RecA-assisted 
restriction endonuclease (RARE) technique (Koob, M. et al., 
1990, Science 2^:271; Ferrin, L.J. et al . , 1991, Science 

35 254:1494; Koob, M. et al . , 1992, Nucleic Acids Res. 20:5831). 
Briefly, the RARE technique involves the generation of 
restriction endonuclease cleavage events that occur solely 
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within the specific hybridization product. By combining 
sequence-specific RARE methods with the ability to visualize 
the formation of restriction cleavage sites, as described for 
optical mapping, above, in Section 5.3.1, specific 
5 hybridization products can be detected without prior labeling 
of the nucleic acid being hybridizaed to the elongated, fixed 
single nucleic acid molecules of the invention. 

The RARE technique, more specifically, makes 
hybridization product -specif ic restriction possible by 
10 selectively blocking methylase, such as EcoRI methylase, 

enzymes from acting upon the specific hybridization products. 
Methylases are enzymes which methylate nucleic acid molecules 
in a sequence specific manner, and nucleic acid which has 
been methylated is no longer subject to restriction 
15 endonuclease action. For example. EcoRI methylase methylates 
nucleic acid molecules at the EcoRI recognition site such 
that EcoRI will no longer cut at that site. Once each of the 
restriction sites outside the site of specific hybridization 
are methylated, restriction digestion is performed. The only 
20 resulting cleavage sites are those within the region where 
specific hybridization had occurred, thereby identifying the 
position of such hybridization. 

RARE uses RecA protein to block methylase activity in a 
site specific manner. The RecA protein exhibits an ability 
25 to pair a nucleic acid molecule to its complementary, 
homologous sequence within duplex DNA such that a triple 
stranded nucleic acid/RecA complex is formed. Such a complex 
is protected from methylase activity. Thus, the introduction 
of a nucleic acid molecule which will hybridize to at least a 
30 portion of the elongated, fixed single nucleic acid molecules 
of the invention, together with a RecA protein (and necessary 
RecA cof actor reagents) under conditions, such as those 
described, above, which will yield specific hybridization 
products, generates a triple stranded complex at the site of 
35 such specific hybridization. After formation of such a 

triple helix complex, the nucleic acid molecule is methyated. 
After methylation, enzymes and introduced nucleic acid 
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molecules are removed, leaving duplex DNA which has been 
methylated at all positions except those within the site of 
hybridization. 

The triple stranded complex formation and and subsequent 
5 methylation steps can be performed either before or after the 
elongation/fixation of the single nucleic acid molecules. In 
instances wherein these steps are performed prior to 
elongation, care must be taken with large nucleic acid 
molecules to avoid shearing of the molecules. One method 

10 which can successfully avoid such shearing is to perform the 
steps in agarose gel blocks or "chops". "Chops" refer to 
agarose gel blocks containing nucleic acid molecules, in 
which the gel blocks have been cut into small pieces. When 
triple strand formation is performed in a gel composition, it 

15 is generally more efficient to combine the components in 
molten agarose rather than diffusion into a hardened gel. 

After removal of excess non- hybridized nucleic acid 
molecules and reagents, the nucleic acid molecules can be 
elongated and fixed according to the gel -based or solid- 

20 surfaced based techniques described, above, in Section 5.1. 

Elongated fixed single nucleic acid molecules which have 
been treated as above (either before or after elongation) are 
then subjected to restriction digestion with a restriction 
enzyme that cannot act upon ( i.e. , . cannot cleave) the 

25 methylated DNA. Restriction digestion and cleavage site 
visualization can be performed according to the methods 
described for optical mapping described above in Section 
5.3.1. The only cleavage sites which form are those within 
the site of specific hybridization. Sizing techniques such 

30 as those described, above, in Section 5.2, may be utilized to 
ascertain position. Such sizing techniques are not necessary 
in cases where the mere occurence of hybridization, rather 
than position, of hybridization is being assayed. 

Additionally, methods can be utilized which obviate both 

35 a need to image a cleavage site and the need to denature the 
elongated nucleic acid prior to hybridization. These 
techniques, especially in light of the fact that denaturation 
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is not necessary, allow for more extensive coupling of the 
optical sequencing techniques of this Section with the 
optical mapping methods of Section 5.3.1, above. 

In one embodiment of such an optical sequencing 
S technique, a modified RARE method is utilized. Such a 
modified RARE technique involves, as described above, the 
generation of a triple stranded nucleic acid/RecA complex. 
Because no subsequent restriction will take place, no 
methylation is necessary after the generation of the complex. 

10 In this version of the method, complex generation should take 
place after the elongation/fixation of the single nucleic 
acid molecule of interest. The nucleic acid molecule which 
hybridizes to the elongated fixed single nucleic acid 
molecule is labeled, as, for example, described, above, in 

15 this Section. Because RecA is being used to promote triple 
stranded complex formation, no prior denaturation of the 
elongated duplex DNA is necessary. Upon triple strand 
complex formation, the site of the specific hybridization is 
identified by imaging the labeled nucleic acid molecule with 

2 0 the complex. 

Further, techniques may be utilized which obviate both 
the need for the introduction of a labeled nucleic acid and 
the need to image a restriction cleavage site. Such 
techniques involve the binding of a labeled component to a 

25 siLe containing a specific nucleotide sequence, such as the 
site of a specific hybridization product such that this site 
is, in effect, indirectly labeled. The bound component is 
imaged, thereby identifying, first, that hybridization has 
taken place, and second, making possible the identification 

30 of the position of such hybridizaton . Techniques such as 
this may be especially useful, for example, in diagnostic 
instances wherein the nucleic acid which is introduced to 
hybridize to the elongated single nucleic acid molecules is 
scarce. Further, these techniques make unecessary a need for 

35 amplification of such scarce material prior to hybridization, 
thus avoiding potential amplification-generated artifacts. 
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In one embodiment of such a technique, a modified RARE 
procedure is followed. Specifically, a triple stranded/RecA 
protein complex is formed, as decribed above, however, in 
this case, the RecA which is utilized is labeled. Once 
5 again, because no restriction will take place, no methylation 
step is necessary. The RecA protein must be labeled in a 
manner which retains its activity while allowing for its 
imaging. Such techniques are well known to those of skill in 
the art, and may include, for example, addition of epitope 

10 tags, biotin, streptavidin, and the like. In instances 

wherein the elongated nucleic acid molecule is stained, it is 
important that the color or fluorescence generated by the 
labeled RecA protein is distinguishable from that of the 
stained nucleic acid molecule. Instead, therefore, of 

15 generating and imaging a restriction cleavage site, or the 
imaging of a labeled nucleic acid molecule, the site of 
specific hybridization is identified by merely imaging the 
bound RecA protein. 

In another embodiment of such a technique, the labeled 

20 component is a labeled compound, such as a protein, which 
binds nucleic acid in a nucleotide sequence-specific manner. 
By contacting the labeled protein to the elongated fixed 
single nucleic acid molecules of the invention, the presence 
and positon of such a binding protein could be identified. 

25 

5.3.3 DIRECTED OPTICAL MAPPTMP, 

The optical mapping and optical sequencing techniques 
described herein may be combined such that mapping may be 
performed in a directed fashion. Such technique is referred 

30 to herein as "directed optical mapping". Such techniques 
function to target specific portions of a genome for further 
high resolution mapping analysis. Specifically, single 
nucleic acid molecules which contain specific sequences of 
interest may be identified from among the total single 

35 nucleic acids present in a population of single nucleic acid 
molecules. Once the specific nucleic acid molecules 
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containing the sequences of interest are singled out, such 
nucleic acid molecules may be further analyzed. 

Such directed optical sequencing can serve a number of 
important applications, which include, but are not limited to 
5 diagnostic applications which can directly image, for 
specific loci, any genetic lesion which can be imaged via 
optical mapping. Additionally, fingerprints of specific 
genetic loci can rapidly be obtained for individuals or 
populations. 

10 A number of methods may be utilized to select the single 

nucleic acid molecules to be further analyzed. First, each 
of the nucleic acid molecules which may contain the specific 
sequences of interest can be elongated and fixed, utilizing 
techniques such as those described, above, in Section 5.2. 

15 Once elongated, the single nucleic acid molecules to be 
further analyzed can be identified by using the optical 
sequencing techniques described in Section 5.3.2. Finally, 
those single nucleic acid molecules which hybridize via 
optical sequencing, can be mapped at high resolution via, for 

20 example, the optical mapping techniques described in Section 
5.3.1. 

Alternatively, nucleic acid molecules which will 
hybridize to the sequences of interest can be elongated and 
fixed on the solid surfaces of the invention. Once mounted 

25 onto a surface, all nucleic acid molecules which may contain 
the sequences of interest are contacted with and hybridized 
to the nucleic acid molecules fixed on the surface. Those 
single nucleic acid molecules which contain sequences 
complentary to those fixed on the surface will become bound 

30 to the surface. Once bound to the complementary nucleic acid 
molecules, the entire single nucleic acid molecule which 
contains such hybridizing sequences will become fixed onto 
the surface. Thus, the nucleic acid molecules which are to 
be further analyzed are not only identified, but are 

35 additionally elongated and fixed in a manner which makes them 
amenable to the optical mapping techniques described, above, 
in Section 5.3.1. 
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5*4* HIGH THROUGHPUT OPTICAL MAPPING AMD SEQUENCING 
SYSTEMS AND METHODS 

The high throughput automated system and method of the 
present invention are based on the optical mapping and 
sequencing approaches described above and are capable of 
providing high speed, high resolution mapping and sequencing 
of PGR products, clones and YACs, requiring little or no 
input from human operators . 

Reliable, high speed molecular sizing is at the heart of 
any high throughput molecular analysis method. As defined in 
Section 5.2, there are two main sizing approaches dependent 
on whether or not the molecule being sized is stationary or 
not. High throughput analysis methods can be classified 
accordingly into static and dynamic. Static methods 
generally involve simple equipment and are thus more suitable 
for high throughput measurements at present. On the other 
hand, dynamic methods are typically more accurate but at 
present are less well adapted to high througrhput measurements 
because of the more sophisticated equipment they require. 

In accordance with the present invention, high 
throughput molecular analysis is performed using high speed 
processing of digitized images of stationary or dynamically 
perturbed molecules. Both approaches are considered next. 

5.4.1. STATIC MEASUREMENTS 

In accordance with a preferred embodiment of the present 
invention a novel system is developed for static molecular 
measurements using automated, high speed surface-based 
methods for molecular spotting and fixation. 

5.4.1.1. SPOTTING AND FIXATION 

As discussed in the preceding sections, desirable DNA 
fixation attributes include: a high degree of molecular 
extension, preservation of biochemical activity and 
reproducibility at high deposition rates. Furthermore, the 
development of high- throughput systems for genomic analysis 
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requires that the fixation approach provides high sample 
deposition rates, high gridded sample densities and 
simplified access to the arrayed samples. Inadequate 
attention to any of these fixation aspects is likely to 
5 unduly complicate the analysis and increase its cost. 

Accordingly, in a preferred embodiment of the present 
invention the spotting and fixation equipment of the system 
includes an automated Eppendorf Micro-Manipulator Model 5171, 
and Injector capable of depositing a large number of clone 
10 DNA molecules on a derivatized glass surface while 
maintaining molecular extension and biochemical 
accessibility. In particular, a small capillary tube (about 
100 microns) , or a blunt -ended glass rod are used to draw DNA 
samples and transfer them to the surface by contact as small 
15 droplets of DNA solution. The solution droplets can be mixed 
with a variety of dopants to produce different types of 
elongation conditions, as described above in Section 5.1. 

In particular, the solution droplets are spotted on the 
surface in ordered arrays with spacing and deposition 
20 conditions controlled by an electronic Ludl Mac 2000 

interface box connected to the computer. Spot diameter is 
controlled, for example, by adjusting the inner diameter of 
the capillary tube and ranges between about 40-1000 microns. 
Preferably, smaller-size, high grid density spots are used 
25 for optimal throughput. As clearly illustrated in Figs. 16 
(A,B and C) , smaller size spots seem to increase the 
efficiency of the fixation technique because of the 
relatively large number of molecules which are stretched on 
the periphery of each spot after it dries. In a specific 
30 embodiment of the present invention illustrated in Figs. 16 
A, B and C each spot is about 100 microns in diameter, the 
variation between spots being about +/- 20 microns. The 
center- to- center spacing between adjacent spots is on the 
order of 150 microns, but smaller or larger spacings can also 
35 be used, if desired. The deposition of spots is controlled 
by computer program settings of the Micro-Manipulator and a 
x-y table connected to microstepped motors. Typical 
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deposition rate for this equipment is one spot in less than 
about every 2 seconds . 

In another preferred embodiment of the present invention 
a very large number of clones can be deposited on a 
5 derivatized surface using a Beckman Biomek 2000 robot 
programmed for sample spotting. The use of the robot 
completely obviates the need for human intervention and 
results in approximately ten times faster deposition rates 
compared to standard deposition techniques. Furthermore, the 
10 robot -aided fixation approach can result in reliable 

deposition of very closely spaced DNA samples {20 microns 
with spot-to-spot spacing of about 35 microns) which further 
improves the efficiency of the method. 

In yet another preferred embodiment of the present 
15 invention, a vision controlled pick-and-place robot, 

manufactured, for example, by Research Genetics or Sci-Tech 
(Switzerland) can be used to completely automate the spotting 
process by selecting objects randomly distributed on a plane 
surface and spotting them in a controlled manner on the 
20 derivatized surface. 

As discussed above, although small drops of DNA solution 
can easily be deposited onto derivatized surfaces, these 
molecules (below 40 kb) are not elongated in solution and 
thus cannot be optically mapped. In accordance with the 
25 present invention, any one of three different approaches may 
be used to spread and fix the spotted DNA. In a specific 
embodiment of the invention, spotted DNA molecules can be 
"sandwiched" with a coverslip between two glass surfaces 
which, when pressed together, stretch the DNA molecules 
30 positioned in between. This approach gives acceptable 

mapping results; however, it is serial in nature so that only 
one sample can be measured at a time and is thus not 
effective for high- throughput processing. 

In a second embodiment of the present invention, the 
35 spotted glass surface is rehydrated, after which a teflon 
block stamp is pressed onto the DNA spots, causing them to 
spread and fix on the sticky, derivatized surface. 
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Experimental results indicate that this approach is effective 
for elongating surface mounted DNA without significant 
breakage. Figure 29 shows an enlarged view of a DNA spot and 
the use of a teflon block in accordance with this embodiment 
5 of the present invention to spread the molecules onto the 
derivatized surface. 

In a third; preferred method of the present invention, 
the deposited droplets of DNA solution are simply let dry on 
the derivatized surface. Experiments show that as the 

10 droplets dry most of the fixed DNA remains fully elongated, 
aligned, and primarily deposited within the spot peripheiry in 
a characteristic "sunburst" pattern, clearly observable in 
Figs. 26 A, B and C. Addition of glycerol to the spotting 
solution results in well elongated DNA molecules which are 

15 more uniformly distributed. As discussed above, in practice 
the rehydration of spotted DNA samples with restriction 
endonuclease buffer effectively restores the biochemical 
activity of the spotted molecules. The sunburst fixation 
pattern of elongated molecules in accordance with the present 

20 invention is a completely unexpected discovery which provides 
the basis for novel high throughput analysis methods, and has 
implications which are impossible to predict at this time. 

Following spotting and fixation, in accordance with a 
preferred embodiment of the present invention surf ace- fixed 

25 molecules are next digested by adding 20-40 ^1 of Ix 

commercial restriction buffer (manufacturer recommended) 
containing about 10-20 units of the corresponding restriction 
endonuclease per spotted coverslip; surfaces are then 
incubated in a humidified chamber for about 5-20 minutes. 

30 After digestion the overlaying buffer is removed by washing 
in a beaker of TE (10 mM Tris-Cl, 0 . 1 mM EDTA, pH 7.4) 
buffer. Excess TE buffer is removed with an aspirator. The 
surface is then stained, for example, with YOYO-1 (lOOnM) 
fluorochrome provided by Molecular Probes, and sealed with 

35 immersion oil to prevent drying. Preferably, Cargille 

Immersion Oil for Microscopy can be used to this end. The 
surface-fixed molecules are then ready for optical mapping. 
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Figure 30 illustrates in a block diagram form the method 
of the present invention for high throughput optical mapping 
of lambda or cosmid clones. The figure illustrates the 
sequence of steps of robot -aided spotting of clones onto a 
5 rectangular derivatized glass plate 100; adding restriction 
enzyme; and image processing analysis in computer 200 after 
digestion. 

5.4.1.1.1. Processing of Gridded YAC DNA 

10 Figure 31 is a simplified block diagram of the system of 

the present invention used for high throughput optical 
mapping of gridded YAC DNA. The system in Fig. 31 is an 
adaptation of a clone spotting system for YAC analysis and 
restriction mapping that readily interfaces with existing 

15 automated equipment, yet is useful in laboratories lacking 
sophisticated sample handling technologies. 

In particular, Fig. 31 shows a method for spotting YACs 
as intact chromosomal DNA molecules prepared in microtiter 
plates 100. In a specific embodiment, yeast chromosomal DNA 

20 prepared in agarose can be used. As seen in the figure, 

single droplets of molten-agarose are spotted onto a coated 
surface 110, such as polylysine-, or APTES-coated glass. 
Experimental results indicate that approximately 30-75% of 
dropped molecules on the surface show little breakage, even 

25 for megabased-sized molecules. Restriction enzyme is then 
added, and digestion proceeds for a pre-defined period. 
Finally, a high-contrast f luorochrome, such as ethidium 
homodimer, is added and only imaged molecules fixed on the 
surface are taken for analysis. Note that f luorochrome is 

30 added after fixation and digestion, thus avoiding possible 
f luorochrome-restriction enzyme conflicts. Imaging is also 
done post-digestion. 

In an alternative embodiment, surface mounted DNA can be 
analyzed for map formation by adding restriction enzyme to 

35 the yeast chromosomal DNA molecules in the microtitre plate. 
Products are then analyzed after mounting. Analysis 
techniques in this case would include first end-labeling 
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YACs. Again, the high throughput method of the prsent 
invention employs imaging restriction digestion products post 
digestion. 

5 5.4.1.1,2. Spotting Intact YAC DNA 

Experimental data shows that YAC- sized DNA molecules 
suspended in molten agarose can be elongated and fixed when 
gridded onto the surface as small droplets. With reference 
to Fig. 31, the mounting procedure can be described as 
10 follows: a small amount of DNA embedded in agarose is dropped 
onto a treated surface 110. After this step the droplet 
flows, DNA molecules stick to the surface and elongate. This 
technique is similar to spreading procedures used for 
karyotyping mammalian cells. 
15 To increase the throughput and accuracy of the method, 

in accordance with a specific embodiment of this . invention, 
it is proposed to minimize breakage and optimize molecular 
elongation distributions. Ideally, it would be desirable to 
have all molecules perfectly positioned on a surface, 
20 completely biochemically active, and elongated by the same 
factor. This is a stringent set of specifications that in 
practice does not have to be met because given a sufficient 
sample size simple image processing routines can accommodate 
less than perfect data. 
25 Specifically, in accordance with a preferred embodiment, 

DNA concentration is varied systematically, changing 
pipetting variables (orifice size, delivery time, etc.), gel 
concentration, surface conditions (polylysine composition, 
and other compounds such as APTES) , the temperature of 
30 surfaces and fluids and others. Coated glass surfaces are 
scored in a defined direction to provide sticky grooves for 
molecules to adhere to. The analysis next uses fluorescence 
microscopy to measure the number of fixed molecules, the 
distributions of apparent molecular lengths and the 
35 biochemical activities. Biochemical activity is assayed by 
measuring the restriction digestion activity of previously 
mapped molecules bound on the surface. Importantly, this 
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work does not involve molecules above the surface, trapped in 
the agarose gel layer. 

For high throughput mapping it is important to have 
densly positioned spots dropped onto the surface. The 
5 Eppendorf Micro-Manipulator/Injector instrument described 
above is adequate for this purpose. The injector unit of the 
device, which is interfaced with the Micro-Manipulator, 
provides a very reproducible pipetting rate, as well as 
pipette filling time. In this specific measurement it is not 
10 possible to perform high- volume gridding, although 

micromanipulation is reproducible down to a fraction of a 
micron. 

In addition, molecular densities in the spots must be 
optimized in order to maximize the number of molecules imaged 

15 in a field without having significant overlap and crowding. 
It can be recognized that crowding complicates the 
recognition of individual molecules or fragments in the 
subsequent analysis. Optimal mount conditions for high 
throughput processing also depend on a number of factors, 

20 molecular size being a major one. Approximately 5-10 500 kb 
molecules can be imaged simultaneously using a lOOx objective 
and the camera/digitizing system. If the restriction 
digestion efficiencies are approximately 20% (for full 
digestion) , accurate maps can be created from about 50 to 100 

25 molecules. This means that approximately 5-20 fields are 
necessary to produce one to ten fully digested, scorable 
molecules. In terms of space, this translates to a maximum 
of about (0.5 mm)^per spot.. Accordingly, about 400 spots can 
be placed on a (2 cm) ^ coverslip, assuming a 1 mm center-to- 

30 center spacing between spots. A final consideration is 

preventing agarose spots from drying out during the gridding 
process. Possible solutions to this problem include 
performing gridding under high humidity, adding glycerol to 
the agarose and pipetting through buffer covered with a layer 

35 of a light hydrocarbon. 

Surface mounted molecules obtained using the methods 
described above provide sharp, high contrast images. These 
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high contrast images are in fact nearly ideal for creating 
binary images because they require little or no processing 
outside of ordinary shading correction operations and are 
simple for the computer to interpret. Therefore, such binary 
5 images are a starting point for most automatic image 

processing routines. Simple automatic imaging routines can 
then be used to discriminate a variety of individual DNAs, 
For example, optical mapping techniques can be used to size 
resulting restriction fragments by measuring fluorescence 

10 intensities and molecular contour lengths, as described in 
Section 5,2. A problem when using this approach for high 
throughput measurements is the recognition of molecules 
useful for analysis. In accordance with the present 
invention a solution to this problem is to automatically 

15 create "masks" from the binary images of mounted DNA 

molecules. These masks can then be used to guide optical 
mapping programs which recognize molecular fragments, size 
them and create maps . 

Furthermore, molecules can be tagged and discriminated 

20 by changing fluorescence microscopy filter packs using a 
computer-controlled filter wheel. Naturally, in such case 
tags are designed with spectral characteristics that differ 
from that of the bulk-stained molecule. Other molecular 
tagging approaches that are compatible with optical mapping 

25 can also be used. 

5.4.1.2. IMAGE PROCESSING EQUIPMENT 
In a preferred embodiment of the present invention 
imaging of the pretreated surf ace- fixed molecules is 

30 performed using a Zeiss Axioplan or Axiovert 135 microscope 
equipped for epi- fluorescence (filter pack for green 
excitation and red emission, or preferably a YOYO filter 
pack, 490 nm excitation, 510 nm emission) and Plan-Neof luar 
objectives (16x, lOOx; made by Zeiss) . The microscope is 

35 coupled to a Hamamatsu C2400 SIT focusing camera and an 

imaging Photometries PXL Cooled CCD camera. In a preferred 
embodiment, the spatial resolution of the image processing 
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equipment is 1032x1316 pixels per image, with 12 bits/pel raw 
image gray- level resolution and 16 bits operating resolution. 

In a preferred automated embodiment of the present 
invention the output of the focusing SIT camera is used in a 
5 feedback loop for auto-focus control and positioning to 
adjust the x-y position of the spot being imaged. The 
electronic auto- focus unit and a stepping motor unit 
connected to the microscope focus control are provided by 
Ludl Electronics and are known in the art. This system acts 

10 as an automated microscope capable of automatically moving 
from one imaged spot to another. 

In a preferred embodiment of the present invention, the 
processing computer is a SUN workstation with 128 MB RAM and 
32 GB hard drive space enabling continuous processing of 

15 large volumes of image data. In an alternative embodiment, 
the digitized images are stored for subsequent processing 
using a Macintosh computer. In this embodiment, a modified 
version of the commercial software package Ip Lab distributed 
from Signal Analytic, or a modified version of the NIH 

20 commercial software for Macintosh computers can be used. 

Figure 32 is a block diagram of another embodiment of a 
system for optical mapping in accordance with the present 
invention which preferably includes a cooled CCD camera 
(Photometries, AZ) . While the equipment in Fig. 32 is less 

25 suitable for high throughput measurements than the one 

described above, it can be used in certain applications which 
require the use of fluorescent lifetime microscope, whenever 
it is necessary to distinguish life molecules. 

Specifically, in Fig. 32, microscope 20 is used to image 

30 sample 10 which is placed on computer controlled x-y table. 
Illumination for the microscope is provided by illumination 
source 3 0 which can be a mercury lamp or, in a preferred 
embodiment, a laser source. Computer 40 is connected to 
controller 50 which controls the operation gate pulser 60. 

35 In this embodiment of the present invention gate pulser 60 is 
connected to illumination source 30 and triggers a 
illumination pulse which results in a fluorescence emissions 
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from the sample 40. These emissions are collected by 
microscope 20 and read out by ICCD camera 70 synchronously 
under the control of a gate pulse from gate pulser 60. 

5 5.4.1.3. FLUORESCENT LIFETIME IMAGING 

Figure 33 illustrates a method of optimizing the image 
collection process and maximizing the signal-to-noise ratio 
in accordance with the embodiment of the present invention 
which is illustrated in Fig. 32. The method is based on 

10 limiting the interval during which the camera can collect and 
record images to a time slot when the intensity of the 
illumination source has gone down to zero, as to eliminate 
stray light and scattering from this source. 

As shown in Fig. 33, the heart of the imaging 

15 fluorescence lifetime microscope is the coiled image 

intensified charge coupled device (ICCD) . This low noise 
device can image under light conditions that approach single 
photon counting levels. The ICCD signal-to-noise ratio is at 
least twice as good as a frame averaged SIT camera. 

20 Furthermore, the ICCD is also gatable down to 5 ns, which 
gate is shorter than most fluorescent probe lifetimes. The 
intensification stage on this camera consists of a 
microchannel plate, which functions like a bundle of 
photomultiplier tubes, so that a small number of photons 

25 trigger an avalanche of electrons that hit a phosphor screen 
and produce a bright image. The phosphor screen image is 
sensed by a CCD chip attached to the intensifier by a fiber 
optic coupler, and the chip-born image is transferred into 
the camera controller and digitized. As mentioned, the 

30 intensifier is gated so it can be opened and closed, just 

like a camera shutter. This "shutter", however, is very fast 
and has a gating ratio of better than 5x10^:1. In accordance 
with the present invention the ICCD is a preferred imaging 
system for quantitative work using fluorescent lifetime 

35 microscopy. 

In order to maximize the signal-to-noise ratio of the 
system, the gating feature of the ICCD is used to open the 
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shutter only after the excitation pulse is finished, thus 
substantially eliminating stray light and scattering from the 
illumination source. Hence, the emission photons for image 
collection are generated exclusively from fluorescence under 
5 controlled timing, which enables the system to distinguish 
bound from unbound emissions and stray fluorescence on the 
basis of fluorescence lifetimes. 

As a non- limiting example of the use of the system, for 
an ethidium bromide-DNA complex, the dye lasers are tuned to 

10 525 nm, and the gate widths are set to 63 ns which is about 
three times the lifetime of the bound species. The lifetime 
of unbound ethidium bromide fluorescence in water is only 
about 1.6 ns, so that free fluorochrome emission closely 
follows the excitation profile and can be excluded. 

15 Emissions from other sources of background fluorescence, such 
as the immersion oil, glass slides and sample impurities can 
also be attenuated using this technique. 

In accordance with the present invention gated pulses 
can be timed and synchronized with fluorescence decay. In 

20 particular, the gating pulser is timed to produce a high 
voltage signal during the fluorescence lifetime of the 
fluorochrome -DNA complex. The high voltage pulse opens and 
closes the electronic shutter. Illumination is pulsed with a 
8 ns FWHM duration, so that excitation is present only when 

25 the shutter is closed. Eliminating filters using this 
approach results in increasing the light throughput and 
removing sources of unwanted fluorescence. In a specific 
embodiment the laser excitation repetition rate is variable 
(1-100 Hz) , and the fluorescence emissions accumulate as 

30 charge on the ICCD head; a resultant image builds up 

consisting of bright spots with intensities proportional to 
molecular mass. Two nanosecond lasers, such as the Continuum 
Corporation Nd-YAG pumped TiSaphire tunable solid state laser 
and the Lambda Physik excimer pumped dye laser, are 

35 appropriate for use with these method. 

The sensitivity and size resolution of the system can be 
evaluated using EcoRI digests of lambda bacteriophage DNA 
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stained with ethidium bromide. In particular, images are 
generated and the spot intensities corresponding to single 
molecules are tabulated by image processing routines. These 
intensities are subsequently used to obtain histograms 
5 depicting intensity populations which correspond to fragment 
size populations. The precision and accuracy of the 
measurements can be calculated and used to set proper bin 
widths for the histogram analysis. This sort of analysis can 
be done on DNA molecules flowing through a synthetic silicon 
10 matrix. 

In accordance with this approach DNA fragments are 
preferably in optimal focus because out-of-focus fragments 
have intensity values that can vary for same sized molecules. 
To ensure that molecules are in focus, one can use the 

15 surface mounting techniques described herein. Other methods 
can also include the use of centrifugal forces to spread DNA 
fragments in solution or gel out on a glass surface. 

In accordance with the present invention non-uniform 
illumination can be corrected by a combination of careful 

20 illumination adjustments and by use of processing routines 
developed for relative intensity measurements in optical 
mapping. Essentially, this routine locates local surrounding 
pixels and uses their intensity values to calculate local 
background values. As described in more detail next, local 

25 background values compensate for uneven illumination and act 
as shading correction. 

To improve the resolution of the method different 
f luorochromes can be used, e.g. . having varying degrees of 
sequence specificity and, if appropriate, f luorochromes with 

30 complementary sequence biases, such as ethidium homodimer and 
ethidium-acridine orange heterodimer. The image contrast can 
be improved further by eliminating unbound f luorochrome . For 
example, ethidium monoazide (Molecular Probes, Inc.) is a 
fluorochrome that covalently attaches to DNA in high yield by 

35 photochemical means, and unbound compound can be readily 
extracted from the labeled DNA before mounting. 
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As indicated above in Section 5.2.2., a series of well- 
defined DNA fragments can be added to the sample as internal 
fluorescent size standards. The concentration of 
fluorescence intensity standards is adjusted so that they are 
5 readily identifiable in any histogram analysis. 

Fluorescence lifetime microscopes can also be used to 
improve intensity based sizing for larger fragments (50-1,000 
kb) or 1-1,000,000 kb. The results of the above sizing 
analysis obtained for a restriction digest of a pure sample 

10 can be an optical fingerprint and analogous to a fingerprint 
(without the hybridization step) derived from gel 
electrophoretic methods. Ancillary methods can use this 
advanced sizing methodology to produce ordered maps from 
genomic DNA and YACs of particular individuals or populations 

15 or subpopulations at high speed. 

5.4,2. IMAGE PROCESSING 

Figure 34 is a block diagram of the high throughput 
image processing method in accordance with a preferred 

20 embodiment of the present invention. 

Specifically, step Al of the method is a flat field 
correction of the raw image. The flat field correction is 
used to provide an image in which pixel values are 
proportional to the amount of dye present at each pixel 

25 location of the sample plane. This operation is typically 
required in cases when the illumination is not uniform over 
the entire field of view. It may also be used to eliminate 
the effects of imperfect image filters which may cause 
visible beat patterns similar to the Moire effect at the 

30 sampling frequencies of the system. The correction is based 
on the assumptions that the emitted fluorescence depends 
linearly on the amount of illumination in the field of view 
and that the camera response is linear. 

Two auxiliary images are used cc perform the flat field 

35 correction: a dark image (no input signal from the field of 
view) and the image of interest (an illumination image) . 
Both images should be collected under identical conditions 
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with no saturation of the video signal in which case the 
gray-level histogram of both images is distributed normally. 
Next, the dark image is subtracted on a pixel -by-pixel basis 
from the illumination image to generate a difference signal 
5 which is proportional to the level of illumination at the 
corresponding pixel of the image generated from the light 
striking the camera. The resulting difference signal at each 
pixel is then normalized by the value of the corresponding 
pixel in the illumination image to generate an image in which 
10 pixel values are proportional to the amount of dye in the 
sample . 

The second step A2 of the image processing method in 
accordance with the present invention is to generate binary 
images which roughly correspond to and thus identify the 
15 contours of the desired molecule fragments. In the system of 
the present invention thresholding is automated on the basis 
of constructing a histogram of the image and setting the 
threshold level for binarization equal to the computed 
midpoint between gray levels corresponding to background (no 
20 light) pixels and gray levels corresponding to foreground 
(the molecular fragments) . This step is well known in the 
art and will not be considered in further detail. in a 
preferred embodiment of the invention, the step of generating 
binary images is preceded by a filtering operation designed 
25 to re-.ove spot noise or other artifacts that may affect the 
accuracy of the method. Preferably, such spot noise can be 
eliminated by the use of a 2-D median filter of size 5x5 or 
7x7 , as known in the art . 

In step A3 of the method the imaged molecules are 
30 segmented on the basis of the thresholded images. 

Morphological operation of this type were described in some 
detail in Section 5.2. In a specific embodiment of the 
present invention using NIH image processing routines, a seed 
fragment is selected first by pointing near a desired 
35 fragment. An overlay image of the selected portion of the 
image field is next presented after a four-time dilation 
using pixel replication. Background correction may be used 
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prior to the step of segmentation to reduce the effects of 
unbound dye or imperfect emission filters. In this 
processing step the average background pixel value is simply 
subtracted from each pixel of the image. 
5 In a preferred embodiment of the present invention, 

segmentation of the image, including for example the 
computation of the medial axis of the imaged molecule, and 
the definition and storage of connectivity information is 
done automatically. 

10 In one embodiment of the present invention the 

segmentation step A3 is complete with the identification of 
the DNA fragments. In a second, preferred embodiment of the 
present invention, the identification of fragments is 
followed by the step of boundary extraction and edge linking 

15 as part of a computer routine connecting molecule fragments 
into complete reconstructed molecules. 

As shown in Fig. 34, the last step of the high 
throughput image processing involves sizing of the molecules 
which have been imaged followed by optical mapping, or 

20 possibly optical sequencing, as described in more detail 
next . 



5.4.2.1. OPTICAL MAPPING 

In accordance with a preferred embodiment of the present 
25 invention, high throughput optical mapping is used to 
generate clone maps. The method comprises the following 
steps : 

(1) Imaging the molecules to obtain digital images of 
the clones being analyzed; 

(2) Using relative fluorescent intensity or contour 
length measurements to create maps from individual molecules. 
This involves computation of the relative sizes of individual 
fragments, as described in Section 5.2. above; 

(3) Creating a histogram of all measured molecules 
35 according to the number of cuts detected. As shown for 

example in Fig. 7 the created histogram indicates the number 
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of molecules having a specified number of cuts following 
digestion; 

(4) For each histogram bin which corresponds to a 
specific number of cuts, using statistical analysis of the 

5 maps created in step (2) to obtain information about the 
clustering consistency of the cuts. The consistency 
measurement is determined by computing the pooled standard 
deviation within molecules of a single histogram bin. The 
consistency analysis is based on programs which minimize the 

10 Euclidean distance between members of a single cluster, and 
maximize the distance from other measurement clusters. This 
step of the method can, for example, be performed using a 
commercial statistics routine, such as Systat. (Thus, for 
example, in accordance with the present invention all 

IS molecules determined to have a specified number of cuts are 
examined to determine the consistency of the spacial position 
of the cuts) ; 

(5) In accordance with the present invention, the 
histogram bin which has the highest consistency (i.e.,, the 

20 lowest pool standard deviation) and the largest number of 
fragments is selected for further analysis purposes. 
Specifically, all individual fragment sizes within the 
selected bin are averaged to obtain the estimate of the 
desired ordered map. As indicated in Section 5.2.3. above, 

25 using sample averaging increases the measurement accuracy in 
proportion to the square root of the number of measurements; 

(6) Finally, maps can be aligned ( i .e . . . by placing the 
largest fragment to the left) to generate the desired ordered 
map . 

30 The proposed optical mapping approach in accordance with 

a preferred embodiment of the present invention is simple to 
implement and can thus easily be automated. Furthermore, due 
to the fact that a large number of measurements can be made 
in parallel, the method can provide very high throughput and 

35 also because of its high accuracy, is expected to provide an 
extremely valuable analysis tool for all kinds of practical 
applications . 



- 95 - 



wo 96/31522 



PCT/US96/04550 



5.4.2.2. DE NOVO SEQUENCING 

Optical sequencing, described in Section 5.3. above, is 
a genomic analysis technique which is likely to become 
especially important when used in connection with high speed 
5 optical measurements. In addition, such high speed 
measurements may be utilized in connection with de novo 
sequencing techniques. For example, such measurements may be 
used to quickly analyze the size distribution of a Sanger 
dideoxy sequencing ladder. Specifically, trimmed molecules 

10 are labeled or stained according to standard techniques, 
mounted on the microscope stage and sized using the 
rotational diffusion methods described above. 

The Sanger sequencing technology provides the ideal 
substrates (i^. , stiff, rod-like duplex DNA molecules) for 

15 determining rotational diffusion coefficients. Since the 

dependence of rotational diffusion coefficients on length^ has 
been experimentally determined for molecules in this size 
range (50- 500 bp) , a resolution of one base pair difference 
in size can be achieved. For example a 200 vs. a 199 bp 

20 molecule will show a relative rotational diffusion 

coefficient ratio of 1.025; while a 100 vs. 99 bp molecule 
will exhibit a ratio of 1.0360. Thus, even when dealing with 
moderately long polynucleotides, an adequate level of 
resolving power still exists. Furthermore, the data measured 

25 can be expected to be very accurate despite some errors in 
measurement, since the determined length varies as the time^^^ 
measured. 

Despite the extraordinarily high molar chromophore 
concentration contained within a small rod of DNA, the total 

30 number of chromoforms, and, therefore, the total fluorescence 
is low. Sufficient sensitivity can be achieved by, for 
example, utilizing a microchannel plate detector that can 
detect single photons. Further, connecting the microchannel 
plate detector to a sensitive SIT camera, and averaging using 

35 image processing techniques, proper data can be obtained. 

Primer length must also be optimized in order to achieve 
maximum size differentiation. Specif icaly, the primer must 
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be long enough to carry sufficient chromophore for detection 
of the smallest molecule in the ladder but not so long as to 
show random coil behavior. Preferably, the minimum primer 
length which can detect the smallest molecule in the ladder 
5 should be utilized. 

For performing such de novo optical sequencing, a group 
of discrete molecules is used and analyzed. Size population 
histograms are obtained from the analysis and statistical 
analysis is used to fully characterize a given sequencing 

10 ladder. To increase the throughput of the system, the image 
processing equipment can measure many objects in parallel. 
Since the measured molecules are small, it is possible to 
image intensity changes of thousands of molecules 
simultaneously. 

15 The de novo optical sequencing data rate is, in 

principle, many times faster than gel -based methods. It is 
estimated that with millisecond relaxation times and multiple 
alignment/size determinations lasting 30 cycles/sequence and 
fast computers, a 300 base pair ladder can be sized in 120 

20 seconds, assuming 4 reactions per sequence, yielding a final 
rate of 9,000 bp/hour. This rate is approximately fifteen 
times faster than the automated sequencer rate presented in 
the National Academy report on mapping and sequencing the 
human genome . 

25 

5.4.3. DYNAMIC MEASUREMENTS 

In accordance with a specific embodiment of the present 
invention, OCM dynamic molecule sizing, as described in 

30 Section 5.2.2.2.1., can be modified to provide high 

throughput methodology by using a new physical effect to 
elongate molecules and new image processing methods to 
measure molecular lengths in real time. 

Specifically, in accordance with the proposed method, 

35 fluid-gel interfaces provide optimal conditions for 

differential frictional forces to act on an electrophoresing 
molecule and elongate it to nearly its full contour length 
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which can be imaged. The net elongation force on the 
molecule is determined by the difference between the value of 
the DNA frictional coefficient in the gel matrix versus its 
value the fluid phase. As known, when a DNA molecule 
5 electrophoreses -through a gel -fluid interface the fluid 

frictional forces are much less than those present in the gel 
matrix. Typically these forces are at least tenfold less, 
but differences can vary with gel concentration. Within the 
gel matrix molecular conformation is dynamic but on the 
10 average it is relatively compact. When a molecule emerges 
from the gel matrix into the free solution frictional forces 
are reduced, thus applying a differential force across the 
molecule and causing it to elongate. Immediately after a 
molecule completely pulls free of the matrix, elongation 
15 forces disappear and the molecule relaxes back to a compact, 
free solution conformation. Reversing the electrical field 
sends the free molecule back into the gel matrix. In 
accordance with the present invention this process can be 
imaged by taking a series of digital images. Next, the 
20 apparent length of the molecule can be measured as it is 

elongated across the boundary between the gel matrix and the 
fluid; measurements can then be averaged as many times as 
needed depending upon the desired accuracy. 

In another embodiment of the present invention, high 
25 throughput relaxation time measurements are performed by 
elect rophoresing molecules through the gel -fluid interface 
described above, and estimating the molecular relaxation by 
measuring the optical length of the molecule at periodic 
intervals to quantitate the degree of relaxation. As 
30 discussed in Section 5.2, the changes of the apparent 

molecular lengths as a function of time can be fitted to a 
single exponential decay function to obtain the relaxation 
time . 

In this embodiment, solution relaxation mechanisms are 
35 somewhat different than gel -based ones, in that coil segments 
are not confined to move within a tube, or a series of 
connected gel pores. Rather, in a free solution, elongated 
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DNA molecules relax by evolving from a drawn-out prolate 
ellipsoid to a more symmetric, spherical conformation. 
Relaxation times are also shorter in free solution (generally 
10-fold less) : for example, a 500 kb molecule has a 
5 relaxation time of 4 seconds. However, since solution 

relaxation times are inversely proportional to the solution 
viscosity, measurements on small molecules can be made on a 
convenient time scale by simply adding glycerol or sucrose to 
increase the viscosity of the solution. It is significant to 

10 note that the shorter relaxation times manifested by solution 
based relaxation measurements are advantageous for any high 
throughput approach because they can enable automated 
collection of images at regular intervals which can then be 
used to determine automatically the desired relaxation times. 

15 In a specific embodiment, the high throughput dynamic 

size measurement techniques of the present invention can be 
performed by electrophoresing molecules through the interface 
at a rate of approximately 20-50 molecules/minute. Contour 
lengths can be measured and tabulated from stored data by the 

20 same techniques and computer algorithms developed for optical 
mapping and coil relaxation measurements. Images, such as 
those obtained from a SIT camera are rapidly digitized, frame 
averaged and stored as 16 bit resolution files at a rate of 
30 frames/sec. In a specific example, 120 file frame buffers 

25 can be used in the analyzing computer. This means that 120, 
512x512 pixel images can be digitized and stored in about 4 
seconds. More rapid image storage is available by simply 
reducing the image size, in which case the same hardware can 
store 480 128x128 pixel images. Processing algorithms can 

30 thus size 5-10 molecules simultaneously by gathering 

approximately ten images (by averaging 4-16 frames together) 
in a 20 second interval. One gigabyte hard disk provides 
storage space for close to 2,000 full frame images or sizing 
data for 1,000-2,000 molecules. Processing algorithms can be 

35 set up to work in a batch mode and require approximately 3-5 
hours to process one gigabyte worth of image data into 1,000- 
2,000 sizes tabulated on a spreadsheet. These processing 

- 99 - 



wo 96/31522 



PCT/US96/04550 



times are based on unattended operation, but operator 
interfaces can also be used that permit convenient manual 
identification and marking of molecules for analysis. 

Fluorescence images of DNAs obtained in fluid are 
5 generally brighter, sharper and relatively free of 

fluorescing artifacts compared to those obtained in gel. 
Consequently, these images are preferred for unattended image 
processing because they can be transformed reliably into 
digital or binary images. This sizing methodology can be 

10 tested and benchmarked by using a series of Not I digested 
yeast chromosomes mixtures (containing DNAs 30-900 kb) , of 
increasing complexity. Statistical analysis can be performed 
to calculate the precision of single measurements and to 
determine the accuracy of the used methodology. Confidence 

15 intervals can then be determined to establish the minimum 
number of molecules necessary for adequate analysis of 
complex mixtures. This analysis helps determine the usable 
size resolution and size discrimination levels for the 
particular equipment and method being used. Sources of noise 

20 and systematic error should be detected and eliminated as 
much as possible. A lower size limit of 5-20 kb and an 
increased upper size limit are provided in accordance with 
the present invention due tothe fact that molecules with 
contour lengths greater than the microscope viewing field are 

25 sized by offsetting a known distance from the interface and 
by monitoring only coil ends. 



EXAMPLES 

30 The following examples are offered in order to more 

fully illustrate the invention, but are not to be construed 
as limiting the scope thereof. 

EXAMPLE 1. Preparing DNA for Microscopy 

35 G bacteria was grown as described by Fangman, W.L. , 

Nucl. Acids Res., 5, 653-665 (1978), and DNA was prepared by 
lysing the intact virus in 1/2 X TBE buffer (IX: 85 mM 



- 100 - 



wo 96/31522 



PCTAJS96/04550 



Trizma Base (Sigma Chemical Co., St. Louis MO), 89 mM boric 
acid and 2.5 mM disodium EDTA) followed by ethanol 
precipitation; this step did not shear the DNA as judged by 
pulsed electrophoresis and microscopic analysis. 
5 DNA solutions (0.1 microgram/microliter in 1/2 X TBE) 

were diluted (approximately 0.1-0.2 nanogram/al agarose) with 
1.0% low gelling temperature agarose (Sea Plague, FMC Corp., 
Rockport ME) in 1/2 X TBE, 0.3 micrograms/ml DAPI (Sigma 
Chemical Co.), 1.0% 2-mercaptoethanol and held at 65«C. All 
10 materials except the DNA were passed through a 0.2 micron 
filter to reduce fluorescent debris. Any possible DNA 
melting due to experimental conditions was checked using 
pulsed electrophoresis analysis and found not to be a 
problem. 

15 

EXAMPLE 2. Imaging DNA in a Gel 

The sample of Example 1 was placed on a microscope 
slide. To mount the sample, approximately 3 microliters of 
the DNA-agarose mixture were carefully transferred to a 

20 preheated slide and cover slip using a pipetteman and pipette 
tips with the ends cut off to reduce shear. Prepared slides 
were placed in a miniature pulsed electrophoresis apparatus 
as shown in Figures 1 and 2. All remaining steps were 
performed at room temperature. Samples were pre- 

25 electrophoresed for a few minutes and allowed to relax before 
any data was collected. Pulsed fields were created with 
either a chrontrol time (Chrontrol Corp., San diego, CA) or 
an Adtron data generating board (Adtron Corp., Gilbert, AZ) 
housed in an IBM AT computer and powered by a Hewlett Packard 

30 6115A precision power supply. Field Strength was measured 
with auxiliary electrodes connected to a Fluke digital 
multimeter (J. Fluke Co., Everett, WA) . A Zeiss Axioplan 
microscope (Carl Zeiss, West Germany) equipped with 
epif luorescence optics suitable for DAPI fluorescence and a 

35 Zeiss lOOx Plan Neofluar oil immersion objective was used for 
visualizing samples. Excitation light was attenuated using 
neutral density filters to avoid photodamage to the 
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f luorescently labeled DNA. A C2400 silicon intensified 
target (SIT) camera (Hamamatsu Corp., Middlesex, NJ) was used 
in conjunction with an IC-1 image processing system 
(Inovision Corp., Research Triangle Park, NO to obtain and 
5 process video images from the microscope. Images were 
obtained continuously at the rate of one every five or six 
seconds, and as many as 200 digitized images could be stored 
per time course. Each digitized time-lapse image benefitted 
from the integration of 8 frames obtained at 3 0 Hz, which was 

10 fast enough to avoid streaking due to coil motion. After the 
time-lapse acquisition was complete, the microscope was 
brought out -of -focus and a background image was obtained. 
Each time-lapse image was processed by first attenuating a 
copy of the background image, so that the average background 

15 intensity was 82% of the average time- lapse image intensity. 
The attenuated background was subtracted from the timelapse 
image and the resultant image was then subjected to a linear - 
stretch contrast enhancement algorithm. Photographs of the 
processed images were obtained using a Polaroid Freeze Frame 

20 video image recorder (Polaroid Corp., Cambridge, ^4A) . 

EXAMPLE 3. Perturbing Molecules in a Gel 

The molecules of Example 2 were perturbed by POE. POE 
was accomplished by using a series of relatively short normal 

25 pulses of a chosen ratio and then after a longer time period, 
the polarity of one of the fields was switched. The switch 
time and normal field ratio are analogous to the pulsed 
electrophoresis variables of pulse time and field angle. 

The nomenclature used to describe a POE experiment is as 

30 follows: 3,5-80 second pulses, 3 volts/cM) . "3,5-80 

seconds" means a 3 second pulse south-north, followed by a 5 
second pulse east-west; after 80 seconds of this 3,5 second 
cycle, the polarity of the 5 second pulse is changed (west- 
east) for another 80 seconds, and a zig-zag staircase path is 

35 defined for the molecules involved. The pulse intensity was 
3 volts/cM. In this Example, epif luorescence microscopy was 
coupled with the POE method to enable the general study of 
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DNA conformational and positional changes during 
electrophoresis. While the POE method using the adapted 
microscopy chamber shown in Figure 2 was used in this 
experiment, ordinary electric fields switched on and off 
5 could have been used. POE offers certain advantages when 
electric fields are to be applied at different angles, as may 
be needed to rotate a molecule about its long axis. Figures 
1 and 2 show diagrams of the adapted POE chamber. 

10 EXAMPLE 4 . Observing and Measuring 

Molecular Relaxation in a Gel 

The relaxation of the G bacteriophage DNA of Examples 1- 

3 was observed after POE was conducted for 600 seconds (3,5- 

80 second pulses, 3 volts/cm) . 

The image processor is used to quantify and automate the 

imaging of the relaxation process, for example, through 

"feature analysis". Feature analysis works after successive 

images have been digitized and stored, as shown in Fig. 3(a). 

The image processor then identifies discrete objects in the 

20 i^iages, numbers them, and characterizes them according to 
shape. For example, the computer determines the effective 
ellipsoid axes (long and short) for a collection of distorted 
coils and calculate these features as a function of time as 
the coil approaches a spherical conformation during the 

25 relaxation process. Other types of computerized measurements 
also can be made to characterize the DNA. 

The images displayed in Fig. 5, obtained at 12 second 
intervals, show the relaxation of several molecules over a 96 
second time span. In (a), several coils are shown 3 seconds 

30 ^^^^^ applied field was turned off. The coils appear to 

relax through the same corrugated staircase path defined by 
the applied electrical pulses (see molecules marked by 
arrows) as determined by the limits of microscopic 
resolution. In (c) , a molecule is shown splitting into two, 

25 and by (j), all coils have relaxed to a round, unelongated 
conformation. The bar shown in (j) is 10 microns in length. 
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EXAMPLE 5. DetenQiaing the Molecular Weight of One or More 
Molecules by Measuring Relaxation Kinetics 

Molecules of known molecular weight are prepared for 

imaging according to the procedures of Examples 1-3, and' the 

g relaxation time of the molecules is determined by the methods 

of Examples 1-4. Relaxation time data is collected by 

imaging and is used to calculate a mathematical relationship 

between molecular weight and relaxation time of DNA molecules 

of similar composition. The relaxation time of a sample of 

molecules of unknown size is then measured, and the size of 

the molecules is calculated using the mathematical 

relationship determined on the basis of molecules of known 

size . 

25 EXAMPLE 6. 

Determining the Molecular Weight of 
One or More Molecules by 
Measuring Reorientation Rate in a Qel 

Polymers of any size, but particularly those that are 

too small to image (less than approximately 0.1 micron), are 

2Q sized in a matrix such as agarose or polyacrylamide gel by 
measuring the reorientation rate as induced by an applied 
electrical field. Although a reorientation measurements 
could be done in free solution, a matrix is preferred because 
it prevents unnecessary polymer convection and movement. 

25 Additionally the presence of a matrix may enhance the size 
sensitivity, partly because the orientation mechanism is 
different. POE is particularly useful for measuring 
reorientation time because of its experimental versatility 
and very high size resolution of perhaps 15 to 20 megabases . 
Stiff polymers such as DNA molecules (sized less than 150 
base pairs) exist in solution as rods and the rotational 
diffusion coefficient (the friction felt by the rod as you 
try to spin about its long axis) varies as M3 , Using 
microscopy, molecules which are large enough to be imaged are 

22 visualized, and their reorientation time is determined from 
the images. For any size of molecules, particularly those 
which are too small to visualize, the reorientation time of 
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each rod in the field of view is preferably measured by 
spectroscopic methods. Two such methods are described in 
detail below, namely fluorescence dichroism and 
birefringence : 

5 1) A chromophore that binds in a sterically predict- 

able way (ethidium bromide intercalates into DNA molecules) 
is attached to a polymer molecule. Polarized radiation is 
used to excite the chromophore. Measuring the total 
fluorescence intensity temporally provides orientation 
10 information of each molecule. The fluorescence radiation of 
each molecule in the microscope field is measured using a 
sensitive micro-channel plate detector. 

2) The orientational dynamics of a molecule is 
followed with birefringence measurements. Birefringence 
15 techniques measure the change of refractive index, which is 
easily correlated with the orientation of macroraolecules in 
solution or in a matrix. Birefringence measurements are 
taken while the DNA molecules are undergoing gel 
electrophoresis. When an electrical field is applied, the 
20 DNA molecules stretch out and align with the field, thereby 
changing the refractive index. By measuring the change of 
birefringence with time, it is possible to understand details 
of DNA blob train motion as the molecule orients with the 
applied electrical field. 
25 More specifically, birefringence measurements are made 

by determining the phase difference of two orthogonally 
polarized planes of laser radiation (red light) differing by 
a small frequency difference (supplied by the two frequency 
laser) . As the molecules align with the applied electrical 
30 field (in the POE chamber) , which is generated by pulse 
controller 82, the refractive index changes with molecular 
alignment. Light is detected by detector 76, and results. in 
a phase difference in the transmitted radiation, which is 
measured by the phase detector 78 (Fig. 3(b)) by comparing 
35 the value to a standard, sourced at laser 70. The phase 

difference data obtained as a function of time (the period of 
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field application) is digitized and stored on computer 80 for 
later retrieval and analysis. 

The instrument depicted in Figs. 1 and 2 applies the 
necessary fields to cause molecular reorientation. Many 
5 different rotational schemes can be described to optimally 
size molecules in the field. For example, the rotating field 
frequency can be swept to find resonant frequencies with the 
polymer sample. 

10 EXAMPLE 7 

Determining the Molecular Weight of 
One or More DNA Molecules by Measuring 
the Rotation Time of the Molecules In a Gel 

Molecules in the shape of rods or stiff coils are 

prepared and observed as in Examples 1-4, except that an 

15 acrylamide, rather than agarose gel optionally may be used. 

The rate of rotation of a coil or a rod is measured with 
a microscope-based system using any one of the techniques 
described above in Example 6 . Measurements are made of a 
sinusoidally varying signal as the molecule spins about its 

20 center. The sinusoidal signal is used to determine the 

polymer size or molecular weight by fitting the period of the 
sinusoidal signal to the rotational frictional coefficient, 
which varies as the cube power of the rod length. In other 
words, the measured angular velocity as measured from the 

25 sinusoidal signal (radians/sec.) varies as the rod length 
cubed in free solution (Boersma, S. (1960) J. Chem Phys . 32: 
1626-1631, 1632-1635) . 



The conditions for a proposed series of experimental 
runs, with constant t, are shown below. 
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Thus, in the first example, pairs, triplets or other 
sets of pulses of 5 volts/cm are successively applied for ,1 
millisecond in opposite directions, with the direction of the 
first of each successive set of pulses increasing by 10 
degrees in a clockwise direction away from the starting 
point . 

Molecules of known molecular weight are placed in a gel, 
and their rotation rate is determined when the above- 
described electric fields are applied. Rotation time data is 
collected and is used to calculate a mathematical 
relationship between molecular weight and rotation time of G 
bacteriophage DNA molecules in a particular gel. The 
rotation time of molecules of unknown size is then measured, 
preferably using a similar electric field, and the size of 
the molecules is calculated using the mathematical 
relationship determined on the basis of molecules of known 
size . 



EXAMPLE 8 
Determining the Molecular Weight of 
One or More Molecules by Measuring 
Curvilinear Length of DNA Molecules in a Gel 

The procedure of Examples 1-4 is followed for molecules 

of known molecular weight. Measurements of the curvilinear 

length of the molecules while they are in a perturbed state 

is collected by visualizing the molecules and is used to 

calculate a mathematical relationship between molecular 

weight and length. The curvilinear length of perturbed 

molecules of similar composition and unknown size is then 
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measured using the procedures of Examples 1-4, and the size 
of the molecules is calculated using the mathematical 
relationship determined on the basis of molecules of known 
size. Figs. 4 and 5 show perturbed molecules for which 
5 curvilinear length measurements can be made. 

EXAMPLE 9 
Determining the Molecular Weight of 
One or More Molecules by Measuring 
Diameter of DNA Molecules in a Gel 

10 The procedure of Examples 1-4 is followed for molecules 

of known molecular weight, except that measurements are made 
when the molecules are in a completely relaxed state. 
Measurements of the diameter or diameters of the 
substantially spherical or ellipsoidal G bacteriophage DNA 

15 molecules are collected and are used to calculate a 
mathematical relationship between molecular weight and 
diameter of G bacteriophage DNA molecules in the gel. The 
diameter of molecules of unknown size is then measured, and 
the size of the molecules is calculated using the 

20 mathematical relationship determined on the basis of 

molecules of known size. Figs. 4(a) and 5(j) show relaxed 
molecules for which diameter measurements can be made, 

EXAMPLE 10 

25 Preparing Large DNA Molecules for Imaging 

Chromosomal DNA molecules from Saccharomvces cerevisiae 
were prepared and isolated using the insert method and pulsed 
electrophoresis. Low gelling temperature agarose gel (FMC 
Corp. Rockland Maine) was used for preparation to permit 
relatively low temperature melting. Since UV radiation can 
break DNA molecules, desired bands were cut out of the gel, 
guided by ethidium stained flanking edge sections that were 
cut out of the gel, which were then photographed on a 301 nm 
transilluminator apparatus. The bands were then weighed and 
2^ equilibrated with a 10-fold excess of lOmM spermine in water 
for 3 hours at room temperature. Spermine requires a very 
low ionic strength environment to condense DNA and. 
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fortunately, the buffers used in electrophoresis are low 
ionic strength, thus eliminating the need for an 
equilibration step. The equilibrated samples were then 
melted in an oven at 74 for two hours and after melting. 
5 DAPI (1 microgram/ml) and 2-mercaptoethanol (1%) were added. 
3 microliters of the melted agarose/DNA mixture were 
carefully applied to a pre-heated microscope slide and a 
cover slip was placed on top before the mixture gelled. The 
slide was then viewed using a Zeiss Axioplan epif luorescence 

10 microscope fitted with a lOOX Plan Neofluar objective and 
showed small intensely bright balls which could be 
decondensed by the addition of salt, through the edges of the 
coverslip sandwich. 

As mentioned above, spermine is particularly useful in 

15 an environment of low ionic strength. On the other hand, if 
DNA molecules are placed in a highly ionic environment, the 
same type of condensation effect are accomplished with 
alcohol. Neither of these examples are to be construed as 
limiting the scope of the invention. 

20 

EXAMPLE 11 
Restriction mapping Schizosaccharomycas 
pombe Chromosomal DNA Molecules 

The DNA of Schizosaccharomyces pombe, a fungus with a 

genom'=» size of about 17-19 megabases distributed on three 

chromosomes 3, 6 and 8-10 megabases in size, is prepared for 

microscopy by condensation and uncollapsing, according to the 

method of Example 10. The 3-5 microliter agarose mixture 

contains approximately O.l nanograms of DNA, 0.5% b- 

mercaptoethanol , 1 microgram/ml DAPI, 100 micrograms/ml 

bovine serum albumin (acetylated; Bethesda Research 

Laboratories, Gaithersburg, MD) and 10-20 units of an 

appropriate restriction enzyme. This mixture is briefly held 

at 37°C and carefully deposited on a microscope slide and 

then topped with a coverslip. Prior to digestion with 

restriction enzymes the DNA is stretched by one of two ways: 

(1) the liquid slide/agarose/coverslip sandwich is optionally 
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sheared slightly by moving the coverslip or (2) an electrical 
field is applied using, for example, the POE instrument 
described in Fig. 3. A 10 mM magnesium chloride solution is 
then diffused into the sandwich once the gel has set. When 
5 the magnesium ions reach the DNA/enzyme complex, the enzyme 
cleaves the DNA molecule . 

The positions of the restriction cutting sites are 
determined by following the DNA strand from one end to the 
other using the microscope setup and noting cut sites. These 

10 sites appear as gaps in the strand, which is continuous 
before enzymatic digestion. The size of each of the 
fragments is then determined by the microscopic methods of 
this invention, including, (1) measuring the curvilinear 
length of each fragment, (2) allowing the fragments to relax 

15 and measuring their diameter, (3) perturbing the conformation 
of each fragment with an applied electrical field or flow 
field (as generated by moving solvent through a gel) and 
measuring the relaxation kinetics with direct visual 
detection of conformational and positional changes or 

20 microscopy combined with spectroscopy. Direct visual 
observation is preferred for larger molecules, while the 
other methods are well suited for fragments too small to 
image . 

The resulting sample when viewed using a fluorescence 
25 microscope shows a number of bright balls of three different 
sizes, with diameters varying as M.33, which is based upon 
the formula for the volume of a sphere, 4/3R3. The gel also 
contains a restriction enzyme which is active only when 
magnesium ions are present. 

30 

EXAMPLE 12 
In situ Hybridization of Nucleic 
Acid Probes to Single DNA Molecules 

Nucleic acids are prepared for microscopy as described 

in Examples 1-4 above. The agarose medium containing the 

nucleic acid molecules also contains labelled probes and a 

recombinational enzyme, recA, which mediates strand 
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displacement of the target molecule by the probe. Strand 
displacement and pairing occurs by D-looping (see Radding, 
C, Arm. Rev, Genet. 16:405-37 (1982)). ATP and magnesium ions 
are added to begin the reactions. These ingredients are 
5 diffused into the slide/gel/ coverslip sandwich as described 
in Example 11. The reaction is incubated at 37°c. Many 
different target molecules are simultaneously analyzed, using 
probes with different labels. 

Variations of the method of this invention other than 
10 those specifically described above are within the scope of 
the invention. For example, other parameters of the 
molecules can be measured, and various types of microscopes 
and spectroscopic equipment may be used. The pulsing 
routines for effecting molecule rotation can be varied. 
15 Combinations of the above -described techniques are also 

contemplated. For example, combinations of various types of 
external forces, mediums and spectroscopic techniques are 
within the scope of the invention. Furthermore, a measuring 
technique may be repeated several times, and the measurements 
20 from each trial may be averaged. 



25 



30 



35 



EXAMPLE 13 

Ordered Restriction Maps of Saccharoayces Cereviaime 
Chromosomes Constructed by optical mapping 

Optical mapping (e.g, as shown in figure 6), images are 
made stained, single, deproteinized DNA molecules during 
restriction enzyme digestion, allowing direct, ordered 
mapping of restriction sites. In brief, a flow field (or in 
principle, or other kinds of electrical field) is used to 
elongate DNA molecules dissolved in molten agarose and fix 
them in place during gelation. 

As a non-limiting example, yeast chromosomal DNA (yeast 
strain AB972) was resolved by pulsed electrophoresis 
(Schwartz et al . , Cell 37:67 (1984)) using 1.00% Seakem low 
melting agarose (FMC) , l/2x TBE(42.5mM Trizma base, 44.5mM 
boric acid, 1.25mM disodium EDTA) . Cut gel bands were 
repeatedly equilibrated in TE (lOmM Tris-Cl, ImM EDTA, 
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pHS.O). The gel embedded, purified chromosomes were then 
equilibrated overnight at 4°C in magnesium- free restriction 
buffer containing 0,1 mg/ml acetylated bovine serum albumin, 
1% )3-mercaptoethanol, 0.1% Triton X-100 (Boehringer Manheim, 
5 membrane quality), and 0.2 /ig/ml 4', 6-diamino-2 phenylindole 
dihydrochloride (DAPI) with slow shaking. Equilibrated 
samples ranging in volume from 50 to 100 ul were melted at 
72°C for 5 minutes, and then cooled to 31^C. Approximately 
0.3 - O.Sul of enzyme (2 to 14 units//xl) was spread on a 

10 slide. Enzyme reaction temperatures were as recommended by 
manufacturers. )8-mercaptoethanol was added to discourage 
photolysis M. Yanagida et al . in Applications of Fluorescence 
in the Biomedical Sciences, D.L, Taylor et al . , Eds. (Alan R. 
Liss, New York, 1986), pp. 321-345. and was tested at this 

15 concentration for any deleterious effects on digestion using 
electrophoresis. A 7/il volume of the melted sample was 
typically pipetted (slowly) using a wide bore pipette tip 
onto an 18X18 mm cover glass and rapidly deposited onto a 
slide. Timing and quenching of the gel is critical for 

20 controlling elongation. The reaction chamber was then sealed 
with mineral oil to avoid evaporation, and the agarose was 
allowed to gel for at least 30 minutes at 4*'C, prior to 
diffusion of 50mM MgClj through an open space. For chromosome 
I(240kb) and III (345kb) , slides were in a cold desiccator 

25 (4*>C) prior to casting to hasten gelling avoiding premature 
molecular relaxation. For the larger chromosomes, which 
relax more slowly, slides were kept at room temperature. The 
slide was placed on a temperature controlled microscope stage 
at 37°C (except Cspl, 30°C) . The gelatin process restrains 

30 elongated molecules from appreciably relaxing to a random 
coil conformation during enzymatic cleavage. A restriction 
enzyme is added to the molten agarose-DNA mixture and cutting 
is triggered by magnesium ions diffused into the gelled 
mixture (mounted on a microscope slide) . Cleavage sites are 

35 visualized as growing gaps in imaged molecules. DNA 

molecules were imaged using a Zeiss Axioplan or Axiovert 135 
microscope equipped for epi - fluorescence (487901 filter pack 
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for UV excitation and Blue emission) and a lOOX or 63X Plan- 
Neofluar objective (Zeiss) coupled to Hammamatsu C2400 SIT 
cameras. Care was taken to adjust the camera controls to 
avoid saturating the digitizer at either end of the intensity 
5 range. Every 20 seconds, 32 video frames were digitized to 8 
bits and integrated to give 13 bit precision by a Macintosh 
based Biovision image processor or a Pixel pipeline digitizer 
(Perceptics Corp.). A computer controlled shutter was used 
to limit illumination to 1 . 5 seconds per image giving a total 

10 of about 135 to 255 seconds for typical experiments. Neutral 
density filters were used to keep the illumination intensity 
below 100 /iW measured at the objective. Control experiments 
showed no damage to DNA molecules under these conditions. 
Digitized images were recorded directly to disk and archived 

15 on tape. The resulting fragments are sized in two ways: by 
measuring the relative fluorescence intensities of the 
products, and by measuring the relative apparent DNA 
molecular lengths in the fixating gel. Maps are subsequently 
constructed by simply recording the order of the sized 

20 fragments. Length and relative fluorescence intensity were 
calculated to l€-bit precision using a modified version of 
NIH Image for Macintosh by Wayne Rasband, available upon 
request from the authors (e-mail huff® mcclbO .med.nyu.edu) . 
Briefly, the original unprocessed image was displayed in an 

25 enlarged format and an overlay image was prepared by manually 
tracing the DNA. The length map was made directly from this 
overly. For intensity calculations, the 13 -bit raw data 
image was smoothed and the overlay image was dilated five 
times to cover all foreground pixels. For each pixel marked 

30 on the overlay, a synthetic background value was calculated 
as the weighted average of surrounding pixels, with a weight 
that decreased with distance, but was zero for all marked 
pixels. These values are intended to approximate those which 
would have been measured had the DNA been absent. The 

35 intensity of a particular DNA fragment was the sum of all 
pixels of the fragment minus the matching background pixels. 
The are of the fragment was the original overlay dilated 
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twice. This process was repeated for each frame of raw data 
which had an overlay image, excluding those with poor focus. 
Intensity results were averaged for five images following a 
cut, and the relative sizes of the two fragments were 
5 calculated as xA(x+y) and y/ (x+y) . If fragment y later cuts 
into u and v, then (y/ (x+y) ) (u(u+v)) is used for the size of 
u. The resulting numbers constitute a single sample for the 
purposes of subsequent analysis. Averaging a small number of 
molecules rather than utilizing only one improves accuracy 

10 and permits rejection of unwanted molecules. The samples 

were averaged and the 90% confidence interval on the mean was 
calculated using the t distribution with n-1 d.f . and the 
sample standard deviation. This calculation is valid if the 
data represent random samples from a normal distribution. 

15 There is a 90% chance that the population mean falls within 
the confidence interval. For chromosome I, the reported 
confidence interval was found by taking the lower bound from 
the short fragments and upper bound from the long fragments. 
The 90% confidence interval for the population standard 

20 deviation was calculated using the sample standard deviation, 
the number of samples, and the chi-square distribution with 
n-l d.f. The midpoint of this interval was used to estimate 
the population standard deviation. The coefficient of 
variation (CV) is the estimated population standard deviation 

25 divided by the sample mean. The pooled standard deviation is 
the square root of the average of the variances. The 
relative error is the differences between value obtained 
hrein and the reported value divided by the reported value. 
Optical map production is very rapid because of the 

30 combination of restriction fragment ordering in real time 
with fast accurate sizing techniques. Optical mapping is a 
powerful new technology for rapidly creating ordered 
restriction maps of eucaryotic chromosomes or YACs, without 
the need for analytical electrophoresis, cloned libraries, 

35 probes, or PGR primers. Incremental technical improvements 
should enable the rapid high resolution mapping of mammalian 
chromosomes and ordering of YACs. 
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Gel fixation and mechanics of DNA relation under tension 
and cleavage: A single large DNA molecule 200 fim long (600 
kb) is a random coil in solution which can be visualized as a 
loosely packed ball averaging 8 across (Roberts, T.M. et 
5 al., 1975, CRC Crit . Rev. Biochem. 3:349). Optical mapping 
begins with stretching out such a DNA molecule and fixing it 
in place to inhibit rapid relation, prior to imaging by light 
microscopy. The fixed molecule must lie within a shallow 
plane of focus for successful imaging. Elongated molecules 

10 in a gel behave mechanically like a stretched spring 
(Schwartz, D.C. & Koval, M., 1989, Nature 138:520-522): 
fixed molecules are under tension which is released during 
coil relaxation to a random conformation. However, excess 
fixation is undesirable for optical mapping, since 

15 restriction cleavage sites must relax to be detected and 
imaged as growing gaps . 

DNA molecules embedded in agarose gel, during 
electrophoresis, have previously been modeled, by Zimm (Zimm, 
B,H., 1991, J. Chem. Phys . 94=2187-2206) as a series of 

20 connected pools of coil, segments under tension with each 
other, and it was calculated that the force (fi) associated 
with the free energy change of shuttling coil segments 
between pools is given by 

f i=3kT/ (2nib) ( (a2/nib2) -1) + (kT/b) InC, where k is the 

25 Boltzmann constant, a is the gel pore diameter, ni is the 
number of associated coil segments, b is the coil segment 
length, T is the temperature and C is a constant relating to 
coil segment structure. This result shows that the tension 
developed between pools is inversely related to the number of 

30 segments contained with a pore volume (Eq.l). It follows 
that a stretched out, elongated molecule is under more 
tension than a compact, relaxed one. 

Large DNA molecules can be stretched out in molten 
agarose by flow forces and then rapidly fixed in place by 

35 agarose gelation, without application of electrical fields. 
Yeast chromosomal DNA (yeast strain AB972) was resolved by 
pulsed electrophoresis (D. C. Schwartz and C.R. Cantor, Cell 
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37,67 (1984)) using 1.00% Seakem low melting agarose (FMC) , 
l/2x TBE(42.5mM Trizma base, 44.5mM boric acid, 1.25mM 
disodium EDTA) . Cut gel bands were repeatedly equilibrated 
in TE (lOmM Tris-Cl, ImM EDTA, pH8,0). The gel embedded, 
5 purified chromosomes were then equilibrated overnight at 4«C 
in magnesium- free restriction buffer containing 0.1 mg/ml 
acetylated bovine serum albumin, i% ^-mercaptoethanol , 0.1% 
Triton X-100 (Boehringer Manheim, membrane quality), and 0.2 
Mg/ml 4', 6-diamino-2 phenylindole dihydrochloride (DAPI) 

10 with slow shaking. Equilibrated samples ranging in volume 
from 50 to 100 ul were melted at 72«C for 5 minutes, and then 
cooled to 370c. Approximately 0.3 - 0.5ul of enzyme (2 to 14 
units/Ml) was spread on a slide. Enzyme reaction 
temperatures were as recommended by manufacturers. /8- 

15 mercaptoethanol was added to discourage photolysis M. 
Yanagida et al . in Applications of Fluorescence in the 
Biomedical Sciences, D.L. Taylor et ai . , Eds. (Alan R. Liss, 
New York, 1986), pp. 321-345. and was tested at this 
concentration for any deleterious effects on digestion using 

20 electrophoresis. A 7/il volume of the melted sample was 
typically pipetted (slowly) using a wide bore pipette tip 
onto an 18X18 mm cover glass and rapidly deposited onto a 
slide. Timing and quenching of the gel is critical for 
controlling elongation. The reaction chamber was then sealed 

25 with mineral oil to avoid evaporation, and the agarose was 
allowed to gel for at least 30 minutes at 4«'C, prior to 
diffusion of 50mM MgC12 through an open space. For 
chromosome I{240kb) and III (345kb) , slides were in a cold 
desiccator (4°) prior to casting to hasten gelling avoiding 

30 premature molecular relaxation. For the larger chromosomes, 
which relax more slowly, slides were kept at room 
temperature. The slide was placed on a temperature 
controlled microscope stage at 37°c (except Cspl, 30°C) . 
Experimentally, the kinetics of gelation are controlled by 

35 temperature, and optimization of the annealing conditions. 
For this analysis, DNA coils must be critically stretched: 
too much and molecule becomes difficult to image; too little, 
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and there is insufficient tension to reveal cut sites. Yeast 
chromosomal DNA (yeast strain AB972) was resolved by pulsed 
electrophoresis (D. C. Schwartz and C.R. Cantor, Cell 37,67 
(1984)) using 1.00% Seakem low melting agarose (FMC) , l/2x 
5 TBE(42.5mM Trizma base, 44,5mM boric acid, 1.25mM disodium 
EDTA) . Cut gel bands were repeatedly equilibrated in TE 
(lOmM Tris-Cl, ImM EDTA, pH8.0). The gel embedded, purified 
chromosomes were then equilibrated overnight at 4*>C in 
magnesium- free restriction buffer containing 0.1 mg/ml 

10 acetylated bovine serum albumin, 1% /3-mercaptoethanol, 0.1% 
Triton X-100 (Boehringer Manheim, membrane quality), and 0.2 
ug/ml 4', 6-diamino-2 phenylindole dihydrochloride (DAPI) 
with slow shaking. Equilibrated samples ranging in volume 
from 50 to 100 ul were melted at 72**C for 5 minutes, and then 

15 cooled to 370c. Approximately 0.3 - 0 . 5ul of enzyme (2 to 14 
units//il) was spread on a slide. Enzyme reaction 
temperatures were as recommended by manufacturers. ^- 
mercaptoethanol was added to discourage photolysis M. 
Yanagida et al . in Applications of Fluorescence in the 

20 Biomedical Sciences, D.L. Taylor et al . , Eds. (Alan R. Liss, 
New York, 1986), pp. 321-345. and was tested at this 
concentration for any deleterious effects on digestion using 
electrophoresis. A 7^1 volume of the melted sample was 
typically pipetted (slowly) using a wide bore pipette tip 

25 onto an 18X16 mm cover glass and rapidly deposited onto a 
slide. Timing and quenching of the gel is critical for 
controlling elongation. The reaction chamber was then sealed 
with mineral oil to avoid evaporation, and the agarose was 
allowed to gel for at least 30 minutes at 4°C, prior to 

30 diffusion of 50mM MgC12 through an open space. For 

chromosome I(240kb) and III (345kb) , slides were in a cold 
desiccator (4°) prior to casting to hasten gelling avoiding 
premature molecular relaxation. For the larger chromosomes, 
which relax more slowly, slides were kept at room 

35 temperature. The slide was placed on a temperature 

controlled microscope stage at 37<^C (except Cspl, 30*^0. 
Excessively stretched molecules present too little 
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fluorochrome per imaging pixel, so that measured molecular 
intensities approach background values. Additionally, the 
fixation process has to be gentle enough to permit some coil 
slippage to reveal cut sites. Taking these and other 
5 considerations into account, the present fixation conditions 
were optimized to produce molecules spanning approximately 
20% of their curvilinear contour lengths. 

How DNA molecules are entrapped by agarose gelation is 
not known. Imaged, stretched molecules show bright round 
10 pools of coil at their ends, evidence of chain relaxation 
(Figs. 8, 9) . The pool sizes range from 1-3/ira. Segmental 
pools are also observed to form internally, and then 
disappear, as local pockets of coil tension equilibrate with 
each other. As a DNA molecule relaxes within the train of 
15 contiguous gel pores it spans, the segmental density 

increases, and segments can even be seen to spill over into 
neighboring pore spaces. The detailed relaxation mechanism 
is a complex one (de Gennes, et al . , Scaling Concepts in 
Polymer Physics, Cornell University Press, 1979) . Gaps 
20 appear because a molecule experiences an effective tension 
since the conf igurational entropy of the elongated polymer 
is lower than that of the relaxed state. On a simple 
descriptive level, the process can be compared to watching 
the relaxation of a stretched-out thick rubber band encased 
25 in a tight tube, with holes in the sides. Cleavage 
accelerates relaxation by creating new ends within a 
molecule, and possibly also by causing a mechanical 
perturbation that releases trapped fragments from local 
energy minima. 

30 A high numerical aperture microscope objective can 

produce bright, high contrast images of stained DNA 
molecules, but with a very shallow depth of focus. 
Experimentally, for a long molecules to be in focus, it must 
lie within a plane approximately Q.2fim thick. Our method of 

35 gel fixation reproducibly allows visualization of molecules 
that are within this 0.2 micron tolerance as measured 
optically. This remarkable degree of optical flatness 
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results from a laminar, parabolic fluid flow pattern 
generated between the glass surfaces, prior to gelation. 
Furthermore, dissolved agarose and DNA molecules may 
potentiate this effect by facilitating laminar flow, while 
5 preventing onset of turbulence. 

Finally, gel fixation of large DNA molecules is 
convenient enough to be broadly applicable to other systems, 
especially when biochemical reactions can be coupled to 
visualizable events. 
10 Restriction Digestion of Single Molecules. Optical 

mapping detects restriction enzyme cleavage sites as gaps 
that appear in a fixed molecule as fragments relax to a more 
random conformation (Figs. 13,15). Since the rates of 
enzymatic cleavage by different restriction enzymes are 
15 variable (Wells, et al., Genetics 127,681, 1981), careful 

adjustment of the timing is critical. Cleavage should occur 
only after molecular fixation is complete because premature 
reactions disrupt attempts to phase fragments. This timing 
problem was solved by premixing the agarose-DNA solution with 
20 restriction enzyme, at 37oc, and triggering the reaction by 
diffusing magnesium ions into the viewing field, without 
disturbing the gel. Yeast chromosomal DNA (yeast strain 
AB972) was resolved by pulsed electrophoresis (D. C. Schwartz 
and C.R. Cantor, Cell 37,67 (1984)) using 1.00% Seakem low 
25 melting agarose (FMC) , l/2x TBE(42.5mM Trizma base, 44.5mM 
boric acid, 1.25mM disodium EDTA) . Cut gel bands were 
repeatedly equilibrated in TE (lOmM Tris-Cl, ImM EDTA, 
pHS.O) . The gel embedded, purified chromosomes were then 
equilibrated overnight at 4°C in magnesium- free restriction 
30 buffer containing 0.1 mg/ml acetylated bovine serum albumin, 
1% 0-mercaptoethanol, 0,1% Triton X-lOO (Boehringer Manheim, 
membrane quality), and 0.2 ug/ml 4', 6-diamino-2 phenylindole 
dihydrochloride (DAPI) with slow shaking. Equilibrated 
samples ranging in volume from 50 to 100 ul were melted at 
35 72<'C for 5 minutes, and then cooled to 37oc. Approximately 
0.3 - 0.5ul of enzyme (2 to 14 units/^l) was spread on a 
slide. Enzyme reaction temperatures were as recommended by 
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manufacturers. /3-mercaptoethanol was added to discourage 
photolysis M. Yanagida et al . in Applications of Fluorescence 
in the Biomedical Sciences, D.L. Taylor et al . , Eds. (Alan R. 
Liss, New York, 1986), pp. 321-345. and was tested at this 
5 concentration for any deleterious effects on digestion using 
electrophoresis. A 7/xl volume of the melted sample was 
typically pipetted (slowly) using a wide bore pipette tip 
onto an 18X18 mm cover glass and rapidly deposited onto a 
slide. Timing and quenching of the gel is critical for 

10 controlling elongation. The reaction chamber was then sealed 
with mineral oil to avoid evaporation, and the agarose was 
allowed to gel for at least 30 minutes at 4*^0, prior to 
diffusion of 50mM MgCl2 through an open space. For 
chromosome I (240kb) and III (345kb) , slides were in a cold 

15 desiccator (4*^0 prior to casting to hasten gelling avoiding 
premature molecular relaxation. For the larger chromosomes, 
which relax more slowly, slides were kept at room 
temperature. The slide was placed on a temperature 
controlled microscope stage at 37**C (except Cspl, 30*>C) . 

20 Aside from gaps, cleavage is also signaled by the appearance 
of bright condensed pools or "balls" of DNA on the fragment 
ends at the cut site. These balls form shortly after 
cleavage and result from coil relaxation which is favored at 
ends (Figs. 13, 15). This pooling of segments is useful in 

25 map rnaking because it helps to differentiate out-of-focus 
segments, that might appear as gaps, from actual cuts. 
Cleavage is scored more reliably by both the appearance of 
growing gaps and enlarging bright pools of segments at the 
cut site. 

30 Map Construction - Fragment Number Determination. Large 

scale restriction maps have been constructed primarily from 
electrophoretically derived data. A new set of approaches 
has been developed to size and order fragments on samples 
that can consist of single DNA molecules, using microscope 

35 based techniques. The first step is to determine the number 
of cleavage sites within a molecule. The cut sites within a 
molecule tend to appear at irregular times after Mg^* 
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addition. All possible cleavage sites do not appear 
simultaneously; instead, cuts usually appear within 5 minutes 
of each other, under the conditions described here. The 
extent of digestion depends on a number of factors including 
5 both the fragment number and size. Digestion results 
obtained by optical mapping for a selected set of Not I 
digested yeast chromosomes are displayed in Fig. 7. 
Fortunately, published Not I restriction enzyme maps are 
available for all S. cerevisiae chromosomes (Link, A. J. , 

10 1991, Genetics 12J681-698) , enabling reliable benchmarking of 
the optical mapping methodology. 

A typical mounted sample contains approximately 3-5 
molecules within a single viewing field and overall, roughly 
50-95% of them show evidence of one or more cuts by the 

15 criteria described here. The histograms in Fig. 7 show that 
the overall number of cut sites exceeding published results 
is quite low. The cutting frequency results (Fig. 7B) for 
chromosome V digested with Not I show that the number of 
fully cut molecules is approximately half that of all singly 

20 cut molecules: the value corresponding to complete digestion 
is caculated by assuming that an equal distribution of 
identically sized chromosome V and VIII DNA molecules are 
present in the mounted sample. The Not I restriction maps 
for these chromosomes reveal that chromosome V has 3 cut 

25 sites, while VIII has only 2. Chromosome XI cutting 
frequency data (Fig. 7C) is different; 25% of all cut 
molecules are seen to be fully digested (two cutting sites) . 
An explanation for the apparently lower frequency is that 
this chromosome produces a 30 kb sized Not I fragment that is 

30 more difficult to detect optically than larger fragments. 

This result is not surprising considering that tension across 
a cut is probably fragment size dependent, so that smaller, 
elongated fragments apply less tension. Furthermore, since 
coil tension across a cut site is required for its 

35 identification, additional cuts will produce fragments that 
ultimately relax to reduce the overall molecular tension and 
impede the observation of further cuts. Finally, very large, 
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1 megabase sized molecules have been spread, such as 
chromosome XIII and XVI, and these data (Pig. 7D) show that 
roughly half of the molecules are digested to completion (one 
cut) in mounts with observable cutting activity. 
5 The maximum number of cuts determined by histogram 

analysis is the bin containing the largest number of cut 
sites whose molecules can be properly averaged by intensity 
and length measurements for size. 

Influence of coil relaxation on detection of cuts. 

10 Aside from cases involving small fragments, incomplete 
digestion is seen in all the histograms in Fig. 7. While 
potential cases range from photo irradiation artifacts to 
interactions imposed by the current design of the microscope 
chamber, partial digestion observed here is attributable 

15 mostly to incomplete coil relaxation at a given cut site, due 
to relaxation modes that fail to produce a gap or distinct 
ball. A variety of different relaxation modes are observed 
in actual practice, some of which are sketched in Fig. 8. 
Relaxation modes can both facilitate (8D) and hinder cut 

20 detection (8H) . Application of electric or flow fields might 
be used to trigger relaxation at such sites and permit their 
detection. Parallel electrophoresis experiments show 
essentially complete digestion under similar experimental 
conditions . 

25 Interestingly, the data for chromosome I show almost 

complete digestion (95%; see Fig. 7A) . images of chromosome 
I under digestion (Fig. 13A) reveal that after the expected 
single cut is observed, only the cut site ends relax and 
bright pools of segments accumulate at the ends (20 

30 molecules), as interpreted in Fig. 8B, 8C and 8D, while the 
remaining ends appear to be fixed in place. Bright pools of 
relaxed coil segments accumulate at the ends of gel -fixed DNA 
molecules, as noted above. 

Conceivably, the ends of chromosome I embedded in 

35 agarose are behaving as a sort of molecular rivet (Fig. 9) , 
reacting to the tension developed between it and the 
intervening molecular segments to provide ideal mechanical 
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conditions for cut detection. It seems likely that short- 
range interactions will predominate so that the amount of 
relaxed coil present at the ends of elongated molecules will 
not vary much with molecular mass above some threshold in 
5 size. Consequently, a relatively short molecule, such as 
chromosome I, will contain a greater proportion of relaxed 
coil segments at its end than longer ones, such as 
chromosomes XIII and XVI. 

Fragment Sizing By Relative Intensity. The second step 
10 is to size the resulting restriction fragments. For this 
purpose two complementary approaches can be used, one based 
on relative fragment fluorescence intensity and the second on 
apparent relative length measurements. However, neither 
approach provides absolute values, but each can be 
15 standardized readily. Fortunately, the gel fixation 

technique described above produces a natural substrate for 
intensity measurements since an entire molecule can be 
brought into focus. Gel fixation is able to flatten 
molecules spanning as much as 250 /im. Segments of molecules 
20 that are out of focus cannot be used for intensity 

measurements because their intensities are not proportional 
to mass in any simple way. A relevant observation here is 
that when an elongated molecules substantially relaxes, most 
of its mass moves out of focus, as expected, since the 
25 hydrodynamic diameter of a fully relaxed 700 kb DNA molecule 
in fluid is 8 ^m while the depth of focus used for imaging 
molecules under the microscope is approximately 0.2 fim. 

The absolute fluorescence intensity of a DNA fragment in 
the microscope is determined by many variables, such as the 
30 camera gain control and lamp brightness, and therefore is not 
a desirable quantity to measure. By calculating the relative 
intensity of two fragments (from the same parental molecule), 
one of the fragments can serve as an internal intensity 
reference for the other. Relative intensities are converted 
35 to kb by multiplying by the know or independently determined 
chromosome size. Length and relative fluorescence intensity 
were calculated to 16 -bit precision using a modified version 
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of NIH Image for Macintosh by Wayne Rasband, available upon 
request from the authors (e-mail huff ® mcclbO , med.nyu.edu ) . 
Briefly, the original unprocessed image was displayed in an 
enlarged format and an overlay image was prepared by manually 
5 tracing the DNA. The length map was made directly from this 
overlay. For intensity calculations, the 13 -bit raw data 
image was smoothed and the overlay image was dilated five 
times to cover all foreground pixels. For each pixel marked 
on the overlay, a synthetic background value was calculated 

10 as the weighted average of surrounding pixels, with a weight 
that decreased with distance, but was zero for all marked 
pixels. These values are intended to approximate those which 
would have been measured had the DNA been absent. The 
intensity of a particular DNA fragment was the sum of all 

15 pixels of the fragment minus the matching background pixels. 
The area of the fragment was the original overlay dilated 
twice. This process was repeated for each frame of raw data 
which had an overlay image, excluding those with poor focus. 
Intensity results were averaged for five images following a 

20 cut, and the relative sizes of the two fragments were 

calculated as x/ (x+y) and y/ (x+y) . If fragment y later cuts 
into u and v, then (y/ (x+y) ) (u/ (u+v) ) is used for the size of 
u. The resulting numbers constitute a single sample for the 
purposes of subsequent analysis. The optical contour 

25 maximization technique can be used to size samples containing 
a small number of molecules (Guo, Nature 359,783, 1992). 
Fig. lOA shows intensity values for a series of yeast 
chromosome Not I restriction fragments measured optically and 
plotted against published values derived from electrophoresis 

30 based measurements (Link, Genetics, 127, 681, 1991) . Points 
close to the diagonal line are in good agreement. 
Disregarding the chromosome V and VIII results, which were 
based on low precision (8-bit) intensity data, and 
disregarding the two short fragments less than 60kb, the 

35 pooled standard deviation is 36kb (Fig. lOA inset) and the 
average of the coefficients of variation is 16%, comparable 
to routine pulsed electrophoresis size determinations. The 



- 124 - 



wo 96/31522 



PCT/US96/(M550 



correlation with published results is excellent: the average 
of the relative errors is 5% whereas the published errors 
average 4% (Link, Genetics, 127, 681, 1991). The samples 
were averaged and the 90% confidence interval on the mean was 
5 calculated using the t distribution with n-1 d.f . and the 
sample standard deviation. This calculation is valid if the 
data represent random samples from a normal distribution. 
There is a 90% chance that the population mean falls within 
the confidence interval. For chromosome I, the reported 

10 confidence interval was found by taking the lower bound from 
the short fragments and the upper bound from the long 
fragments. The 90% confidence interval for the population 
standard deviation {Fig. 10 inset graphs) was calculated 
using the sample standard deviation, the number of samples, 

15 and the chi-square distribution with n-1 d.f. The midpoint 
of this interval was used to estimate the population standard 
deviation. The coefficient of variation (CV) is the 
estimated population standard deviation divided by the sample 
mean. The pooled standard deviation is the square root of 

20 the average of the variances. The relative error is the 

differences between our value and the reported value divided 
by the reported value. Due in part to the intensity 
normalization procedure, the precision becomes lower for very 
small fragments, and size agreement is poor for the 30 and 55 

25 kb measurements. Fluorescence intensity measurements size 
these fragments at almost twice the established values as 
described below. Changes in the algorithm for correcting the 
backgrounds of these measurements and the data collection 
process should improve the precision significantly, 

30 One test of the validity of relative fluorescence 

intensity measurements is to monitor the constancy of 
fragment intensities over a usable range of molecular 
relaxation conditions. This requirement is most critically 
tested when restriction fragments differ greatly in size. 

35 Fig. 11 shows the results of absolute intensities versus 

molecular length measurements for three typical sizes. These 
results show that intensities remain relatively constant over 
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a wide size range despite a 3-4 fold change in measured 
molecular length. This beneficial effect is attributed in 
part to the mild fixation conditions, so that Brownian motion 
can dither the elongated coil along the z-axis; this motion 
5 is clearly observed on the live video monitor as digestion 
proceeds. By averaging frames over a 1 second interval most 
of the DNA is observed as it moves through the focal plane 
and within the gel pores. 

Fragment Sizing by Relative Apparent Lengths. The 

10 physical basis of apparent length measurement is simple: each 
gel -embedded restriction fragment is assumed to have equal 
coil density, on the average. That is, each fragment has the 
same change to be stretched more or less, so a length average 
created over a number of mounts provides a good measure of 

15 relative size. Again, relative apparent lengths are 
converted to kb by multiplying by the chromosome size. 
Length and relative fluorescence intensity were calculated to 
16 -bit precision using a modified version of NIH Image for 
Macintosh by Wayne Rasband, available upon request from the 

20 authors (e-mail huff ® mcclbO.med.nyu.edu). Briefly, the 
original unprocessed image was displayed in an enlarged 
format and an overlay image was prepared by manually tracing 
the DNA. The length map was made directly from this overlay. 
For intensity calculations, the 13 -bit raw data image was 

25 smoothed and the overlay image was dilated five times to 
cover all foreground pixels. For each pixel marked on the 
overlay, a synthetic background value was calculated as the 
weighted average of surrounding pixels, with a weight that 
decreased with distance, but was zero for all marked pixels. 

30 These values are intended to approximate those which would 
have been measured had the DNA been absent. The intensity of 
a particular DNA fragment was the sum of all pixels of the 
fragment minus the matching background pixels. The area of 
the fragment was the original overlay dilated twice. This 

35 process was repeated for each frame of raw data which had an 
overlay image, excluding those with poor focus. Intensity 
results were averaged for five images following a cut, and 
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the relative sizes of the two fragments were calculated as 
x/(x+y) and y/{x+y) , If fragment y later cuts into u and v, 
then (y/ (x+y) ) (u/ (u+v) ) is used for the size of u. The 
resulting numbers constitute a single sample for the purposes 
5 of subsequent analysis. Then, the apparent lengths of 

restriction fragments are converted, obtaining good accuracy 
from as few as 4 molecules. The samples were averaged and 
the 90% confidence interval on the mean was calculated using 
the t distribution with n-1 d.f . and the sample standard 

10 deviation. This calculation is valid if the data represent 
random samples from a normal distribution. There is a 90% 
chance that the population mean falls within the confidence 
interval. For chromosome I, the reported confidence interval 
was found by taking the lower bound from the short fragments 

15 and the upper bound from the long fragments. The 90% 

confidence interval for the population standard deviation 
(Fig. 10 inset graphs) was calculated using the sample 
standard deviation, the number of samples, and the chi- square 
distribution with n-1 d.f. The midpoint of this interval was 

20 used to estimate the population standard deviation. The 
coefficient of variation (CV) is the estimated population 
standard deviation divided by the sample mean. The pooled 
standard deviation is the square root of the average of the 
variances. The relative error is the differences between our 

25 value and the reported value divided by the reported value. 
Relative determinations of apparent length were verified 
against the same set of restriction fragments as in the 
fluorescence intensity measurements, and these results (Fig. 
lOB) show a similar average relative error of 16% (excluding 

30 the 3 0 and 90kb fragments) . The pooled standard deviation 
was 47kb (Fig lOB inset) , the average of the coefficients of 
variation was 29%. 

Apparent molecular length measurements are more robust 
than intensity measurements, but are less precise, and 

35 consequently require additional measurements to achieve an 
equivalent degree of accuracy. But good length measurements 
can be obtained from slightly out-of-focus fragments, whereas 
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blurry, out of focus images will confound intensity based 
measurements. Size determination of small fragments by 
length were better than intensity. The 30kb fragment was 
sized at 44kb by length vs. 70kb by intensity, and the 55kb 
5 fragment was sized at 49kb vs. 88kb. Given the limited 

sample number inherent to optical mapping, having two sizing 
methods for cross-checking results is extremely important for 
successful map making. 

Map Construction Based on Length and Intensity 

10 Measurements. Fig. 12 illustrates three types of ordered 
restriction maps produced by optical mapping compared with 
(Link, Genetics 127, 681, 1991) . The bars shown correspond 
to sizing analysis results of the Not I restriction fragment 
as plotted in Fig. 10. Fig. 13 shows selected processed 

15 fluorescence micrographs of different yeast chromosomal DNA 
molecules digested with Not I. Yeast chromosomal DNA (yeast 
strain AB972) was resolved by pulsed electrophoresis (D. C. 
Schwartz and C.R. Cantor, Cell 37:67 (1984)) using 1.00% 
Seakem low melting agarose (FMC) , l/2x TBE(42.5mM Trizma 

20 base, 44.5mM boric acid, 1.25mM disodium EDTA) . Cut gel 
bands were repeatedly equilibrated in TE (lOmM Tris-Cl, ImM 
EDTA, pH8.0). The gel embedded, purified chromosomes were 
then equilibrated overnight at 4°C in magnesium -free 
restriction buffer containing 0.1 mg/ml acetylated bovine 

25 serum albumin, 1% )3-mercaptoethanol , 0.1% Triton X-100 

(Boehringer Manheim, membrane quality), and 0.2 ug/ml 4', 6- 
diamino-2 phenylindole dihydrochloride (DAPI) with slow 
shaking. Equilibrated samples ranging in volume from 50 to 
100 ul were melted at 72 ®C for 5 minutes, and then cooled to 

30 37''C. Approximately 0.3 - 0.5ul of enzyme (2 to 14 units/^l) 
was spread on a slide. Enzyme reaction temperatures were as 
recommended by manufacturers. )S-mercaptoethanol was added to 
discourage photolysis (M. Yanagida et al . in Applications of 
Fluorescence in the Biomedical Sciences, D.L. Taylor et al . , 

35 Eds. (Alan R. Liss, New York, 1986), pp. 321-345.) and was 
tested at this concentration for any deleterious effects on 
digestion using electrophoresis. A 7/il volume of the melted 
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sample was typically pipetted (slowly) using a wide bore 
pipette tip onto an 18X18 mm cover glass and rapidly 
deposited onto a slide. Timing and quenching of the gel is 
critical for controlling elongation. The reaction chamber 
5 was then sealed with mineral oil to avoid evaporation, and 
the agarose was allowed to gel for at least 30 minutes at 
4°C, prior to diffusion of 50mM MgCl2 through an open space. 
For chromosome I(240kb) and III (345kb) , slides were in a 
cold desiccator (4°) prior to casting to hasten gelling 

10 avoiding premature molecular relaxation. For the larger 
chromosomes, which relax more slowly, slides were kept at 
room temperature. The slide was placed on a temperature 
controlled microscope stage at 37**C (except Cspl, 30°C) . 
These images clearly show progressive digestion by the 

15 appearance of growing gaps in the fixed molecules. From such 
data fragment, order was determined from inspection of time- 
lapse images obtained every 2 0 seconds. DNA molecules were 
imaged using a Zeiss Axioplan or Axiovert 135 microscope 
equipped for epi- fluorescence (487901 filter pack for UV 

20 excitation and Blue emission) and a lOOX or 63X Plan-Neof luar 
objective (Zeiss) coupled to Hammamatsu C2400 SIT cameras. 
Care was taken to adjust the camera controls to avoid 
saturating the digitizer at either end of the intensity 
range. Every 20 seconds, 32 video frames were digitized to 8 

25 bits and integrated to give 13 bit precision by a Macintosh 
based Biovision image processor or a Pixel pipeline digitizer 
(Perceptics Corp.). A computer controlled shutter was used 
to limit illumination to 1 . 5 seconds per image giving a total 
of about 135 to 255 seconds for typical experiments. Neutral 

30 density filters were used to keep the illumination intensity 
below 100 measured at the objective. Control experiments 
showed no damage to DNA molecules under these conditions. 
Digitized images were recorded directly to disk and archived 
on tape. Since observed molecules tend to move and can 

35 sometimes be confused with other molecules, inspection of a 
"cutting sequence" or "cutting movie" simplifies 
deconvolution of molecule-molecule interactions. Agreement 
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is excellent between the optical (length or intensity) and 
the electrophoresis based maps. The third type of 
restriction maps ("Com", Fig. 7) results from combining 
length and intensity derived data: data from small 
5 restriction fragments (<60kb) were sized by length, while 
intensity measurements provide the balance of fragment sizes 
needed to complete the maps. 

Fig. 14 shows the ordered restriction maps created from 
Rsr II digestion of chromosome III and XI and Asc I digestion 

10 of chromosome XI by optical mapping, while Fig. 15 shows the 
corresponding fluorescence micrographs of typical digests. 
Relative apparent length results ; using the pooled population 
standard deviation of 47kb to calculate confidence intervals. 
Chromosome, enzyme, mean +/- 90% confidence kb (number of 

15 samples). Ch, III Rsr II 264 +/- 27(8), 86 +/- 27(8). Ch. 
XI Asc I 42 +/- 55(2), 195 +/- 55{2), 242 +/- 55(2). Ch. XI 
Rsr II 67 +/- 45(3), 127 +/- 45(3), 221 +/- 45(3), 260 +/- 
45(3). Relative fluorescence intensity results, using the 
pooled population standard deviation of 36kb to calculate 

20 confidence intervals. Ch. Ill Rsr II 256 +/- 21(8). Ch. XI 
Asc I 80 +/- 42(2), 177 +/- 42(2), 181 +/- 42(2), 237 +/- 
42(2). Ch. XI Rsr II 84 +/- 34(3), 125 +/- 34(3), 226 +/- 
34(3), 240 +/- 34(3). There are no published maps available 
for independent verification of these results. These maps 

25 are constructed by first determining the maximum number of 
cleavage sites from cutting frequency data (similar to Fig. 
7) . Fragments from fully cut molecules are then sized by 
length and intensity and sorted into bins for averaging. 
Relative fluorescence intensity measurements are used to sort 

30 length measured fragments. Obviously, adjacent fragments 
must go into adjacent bins for averaging. Distinctive 
patterns in a digest, such as a very large fragment lying 
next to a very small one, facilitate accurate sorting. Data 
from partial digests was also used to confirm the maps. Data 

35 from partial digests was used to confirm the map constructed 
from fully cut molecules by calculating the expected partial 
fragment lengths and comparing these to the observed data. 
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A new set of analytical approaches to physical mapping 
of very long molecules, such as DNA molecules, is thus 
provided according to the present invention, that is simple 
and intrinsically very rapid. A nearly real time mapping 
5 procedure for chromosomes of yeast has been implemented, but 
this is far from the ultimate capability of the methodology. 
Since most traditional tools of genomic analysis are 
bypassed, including cloning, electrophoresis. Southern 
analysis and PGR, additional speed increases in optical 

10 mapping are not predicated on advances in robotics or 
automation (Chumakov, Nature 359:380, 1992). Simple 
engineering advances in chamber design, sample handling, 
image analysis and informatics should make available a high 
throughput methodology capable of rapidly mapping entire 

15 genomes and, more importantly, extending knowledge of 

sequence information to populations of individuals rather 
than prototypes of each organism (Cavalli-Sf orza, Am. J. Him. 
Genet 46 :649, 1990) . 



20 EXAMPLE 14: 

Optical Happing of LaxDbda Bacteriophage 
Clones Using Restriction Endonucleases 

In the Example presented herein, the size resolution of 

the optical mapping technique is greatly improved upon by the 

imaging individual DNA molecules elongated and fixed onto 

25 

derivatized glass surfaces. Averaged fluorescence intensity 
and apparent length measurements accurately determined the 
mass of restriction fragments 800 base pairs long. 
Specifically, such a solid surface bsed optical mapping 
technique has been used to create ordered restriction maps 
for lambda clones derived from the mouse Pygmy locus. 

14.1 MATERIALS AND METHODS 
Preparation of polvlvsine coated glass surfaces. Cover 
glasses (18^ mm. Fisher Scientific) were cleaned by boiling in 

35 

5 M hydrochloric acid for 2-3 hours, rinsed thoroughly with 
high purity water, air dried and then incubated overnight in 
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filtered, poly-D-lysine (?W=350,500, Sigma) solutions 
(ranging from 1 x 10"^ to 1 x 10'^ g/ml water) . Autoclaved 
water was used for all solutions. 

Microscopy and image collection. DNA molecules were imaged by 
5 a Zeiss Axiovert 3 5 microscope equipped for epif luorescence 
and a lOOX Plan-Neof luar objective (Zeiss) . A Hammatsu C2400 
SIT camera was used to focus a cooled digital CCD camera 
(1032 X 1316 pixels) controlled by standard, commercially 
available software running on a Quadra 900 computer. 

10 DNA preparation and gel electrophoresis. Analyzed clones come 
from a lambda FIX II library constructed from a YAC, mapped 
to the mouse Pygmy locus. Cells were grown and infected with 
plate grown phage using standard protocols and DNA was 
prepared using a commercially available kit (Qiagen, Germany) 

15 with small modifications. Restriction digests were performed 
as per manufacturers directions and analyzed using 
conventional and pulsed field gel electrophoresis. Gels were 
stained with ethidium bromide and documented with Polaroid 
film and a UV transilluminator . 

20 DNA mounting and restriction digestion. 1 /xl of diluted 
clone DNA (5 ng/^1) was added to Ix restriction buffer (as 
suggested by manufacturer but without magnesium ions) , 3% 
/3-mercaptoethanol and 0.2 ng/^1 ethidium homodimer. 3 to 4 
Ml aliquots were pipetted and spread onto slides with drilled 

25 3 mm holes. Polylysine coated cover glasses were dried by 
gently wiping with lens tissue paper (Ross Tissue, Rosmarin 
Corp.) and placed on top of slides and sealed with a mixture 
of Vaseline and mineral oil. Cover glass-slide sandwiches 
were mounted onto the microscope stage and 5 ^1 amounts of 

30 restriction endonuclease (5 to 10 units) diluted in Ix 
restriction buffer (as suggested by manufacturers) were 
diffused into samples through the drilled holes and then 
incubated for 15 minutes at room temperature. 
Characterizations of polvlvsine coated glass surfaces . 1 /xl 

35 of lambda DNA (New England Biolabs) (5 ng/^1) was mixed with 
100 ^1 of Ix EcoRI restriction buffer (50 mM NaCl , 100 mM 
Tris-HCl and 0.025% Triton X-100, pH 7.5; without magnesium 
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ions), ethidium homodimer (0.2 ng//il) and 5% 
/3-mercaptoethanol . 4 /xl samples were pipetted onto cleaned 
microscope slides (no hole) and covered with cover glasses, 
incubated in different poly-D-lysine solution (MW=350,500) 
5 concentration for 16 hours. 20 to 30 different cover glass 
locations were imaged for each concentration. The length and 
number of DNA molecules from different locations were 
averaged. The number of molecules available on per image 
view were calculated from the DNA concentration, sample 
10 volume and the image area. The ratios of the average number 
of molecules to the available molecules, present in solution, 
were calculated and plotted against the polylysine 
concentration . 

Map construction. Maps were constructed from optical data 

15 using techniques described in Example 13, above, with some 
modifications. Briefly, the image processing steps were flat 
field correction, background correction, segmentation, pixel 
value integration, and intensity ratio calculation. The 
relative intensities of the fragments were calculated and the 

20 size in kb was found by multiplying by the known total size. 
The empirical calibration function was applied to eliminate a 
systematic underestimate of small fragments sizes. Fragments 
less than 6 . 5 kb were divided by 0.665. Larger fragments are 
adjusted to preserve the known total size. 

25 Relative apparent lengths were calculated by magnifying 

the image fourfold by pixel replication and using a mouse to 
place a segmented line along each fragment. Fragment ends 
are placed at the center of the gap between fragments. The 
length of each fragment was the sum of the lengths of the 

30 straight line segments. The size in kb was found by dividing 
by the sum of all fragments and multiplying by the known 
total size. 

The image analysis process was repeated for a number of 
molecules from several images taken from one sample. 
35 Molecules which showed the proper number of cuts were 

analyzed. The orientation of each molecule was determined, 
from the sizes of the cloning arm fragments. This permited 
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averaging of many measurements with little chance of 
including data from one fragment in the average of a 
different fragment . 

5 14.2 RESULTS 

Fixing DNA molecules onto polylvsine coated glass 
surfaces 

Polylysine has long been used to fix cells to glass 
surfaces (Williams, Proc. Natl. Acad. Sci. USA 74:2311-2315, 

10 1977) . Extensive measurements of polylysine coated mica 

surfaces by refractive index measurements (Luckham and Klein, 
Chem. Soc, Faraday Trans. I, 80:865-878, 1984) showed that 
polylysine coils can be compressed onto the surface and thus 
alter its properties. Given the extensive history of 

15 polylysine use in cell biology, it was reasoned that 

polylysine coated glass would be simple to control and be 
biochemically compatible. The molecular weight and 
concentration of polylysine used for surface derivitization 
is critical: too much and the molecules are severely fixed 

20 and biochemically inert; too little, and the elongated DNA 
molecules relax quickly to a random coil conformation. These 
are precisely the concerns successfully dealt with in the 
previous agarose -based optical mapping methodology. 

The polylysine concentration was optimized by plotting 

25 the average molecular extension and count found on the 
surface versus polylysine concentration. Fluorescence 
microscopy was used to image labeled molecules on the 
surface. Fig. 16 shows the results of varying polylysine 
concentration {MW=3 50,500) on the counts of lambda 

30 bacteriophage DNA molecules found on the surface, and the 

average molecular length. As expected, the average molecular 
length was small at low polylysine concentration as were the 
counts of molecules detected on the surface. The average 
molecular extension increased with polylysine concentration 

35 and peaked at 10'^ g/ml; further increase of polylysine 

concentration reduced the molecular extension. Predictably, 
the molecule count on the surface increased. 
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The exact mechanism of how extended DNA molecules 
interact with a polylysine coated surface is unknown. Since 
both DNA and polylysine are highly charged polymers, it is 
postulated that electrostatic interactions predominate. It 
5 is further speculated that the average molecular extension 
varies with polylysine concentration because molecular 
extension forces (due to the mounting procedure) balance 
against electrostatic forces, which are generated at the 
surface. The molecule may be thought to flow laterally onto 

10 the surface and the attachment of its individual binding 
sites in not necessarily a synchronous process. At low 
polylysine concentration, the density of polylysine on the 
surface is minimal so there may not be enough binding sites 
to hold an extended molecule with stability. Thus any lambda 

15 DNA molecule bound to the surface will appear as a random 
coil. At high polylysine concentration, abundant . binding 
sites overwhelm any flow forces and the molecule immediately 
forms electrostatic bonds on a small area, quenching 
molecular translation and further extension. Efficient 

20 binding of molecules is expected. At moderate concentration, 
flow and electrostatic forces are probably balanced, to some 
extent, so that maximum extension can occur. 

For optical mapping the conditions chosen were reflected 
in polylysine concentrations between 10"^ and 10*'' g/ml . 

25 producing molecules extended from ICQ to 140% (see Fig. 16) 
of the polymer contour length. It is speculated that polymer 
contour length over-extension is due to helix unwinding by 
the ethidium homodimer (Guo et al . , Nature 359:783-784, 1992; 
Guo et al., J. Biomol . Structure & Dynamics 11:1-10, 1993) 

30 and fluid flow forces. 

Imaging restriction endonuclease digestion . 
Molecules were first fixed onto the polylysine coated 
surface by sandwiching a sample between a treated coverslip 
and a slide. The DNA sample consisted of DNA, restriction 

35 buffer minus magnesium ions, /S-mercaptoethanol and a 

f luorochrome . It was found that ethidium homodimer (Glazer 
et al., Proc. Natk. Acad. Sci. USA. 87:3851-3855, 1990) was 
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compatible with most restriction endonucleases . Coverslips 
were sealed and a small hole in the slide was used as an 
inlet for restriction enzyme and magnesium ions. 

Restriction digests were originally imaged using a SIT 
5 camera and a 512 x 512 pixel digitizing system housed in a 
Macintosh computer3 . A cooled CCD was later obtained having 
higher spatial resolution {1032 x 1316 pixel), which produced 
images with less noise, less spatial distortion, and better 
linearity. It is the preferred instrument for imaging small 

10 DNA molecules (below 20 kb) . Starting with high contrast, 
noise- free images simplifies image processing procedures and 
streamlines data extraction techniques. 

Previous optical mapping protocol required time lapse 
imaging of the restriction endonuclease activity. Using 

15 surface fixed molecules, final results are simply imaged. 

Since molecules were imaged only once, long exposure times of 
20-60 seconds and an elevated illumination level were used. 
Optimum exposure times vary with magnification and the 
desired number of gray levels. Fig. 17 shows typical images 

20 of lambda clone DNA molecules. 800 bp DNA fragments were 
easily imaged (Fig. 17w) . Generally 20-80 x 62 micron 
microscope fields were imaged, containing approximately 100 
suitable molecules . 

The fixation conditions chosen optimized molecular 

25 extension and provided a reasonable number of surface-bound 
molecules. Fixation conditions, however, are not perfect so 
that not all molecules were optimally extended, as indicated 
by the data shown in Fig. 16, and some molecules intersected. 
Imperfectly fixed molecules were not selected for map-making. 

30 Fig. 17 shows typical molecules selected for map-making. 
Mass determination bv fluorescence intensity and 
apparent length measurements . 

The size resolution of fluorescence microscopy is 
approximately 0.1 microns which translates into approximately 

35 300 bp of B-DNA. Theoretically, smaller molecules can be 
detected, but with no spatial resolution. The usable size 
range of the system described here extends from 28 kb to 800 
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bp (see Figs. 17 and 18) and is based on measuring relative 
apparent lengths and relative fluorescence intensities of 
restriction endonuclease fragments from the same parental 
molecule. This is similar to the technique used in Example 
5 13, above, to construct restriction maps of Saccharomyces 
cerevisiae . 

Use of surface mounted rather than gel mountedl DNA 
molecules has reduced the sizing limit from 60 kb to 800 bp. 
Another notable difference is the greatly improved pooled 
10 SD: 3.1 kb vs. 36 kb for intensity and 1.9 kb vs. 47 kb for 
length. The pooled SD for fragments under 7 kb was 1.3 kb by 
intensity and 0.74 kb by length. Excluding samples with many 
adjacent short fragments, the surface fluorescence intensity 
and length data is very reproducible down to 800 bp, whereas 
15 previous results gave poor results below 6 0 kb. The overall 
relative error (which was the same for length and intensity) 
of 5% for large fragments is comparable to errors in sizing 
by agarose gel electrophoresis. It rises to 10% when small 
fragments (5 kb to 800 bp) are included. Note that 10% of 
20 800 bp is 80 bp which contains about 30 f luorochromes . The 
gel sizing error was 5 to 8%. 

Restriction fragments from 800 bp to 5.1 kb were 
consistently under-sized by fluorescence intensity 
measurements, and consequently, neighboring long fragments 
25 were overestimated. However, the pooled standard deviation 
for small fragments was only 1.3 kb. This suggests that the 
measurements are precise, that the deviation is caused by 
some unknown systematic effect, and that it should be subject 
to calibration to correct for a systematic error. Fig. 20B 
30 shows a separate plot of fluorescence intensity determined 
masses versus gel electrophoresis data. The best fit line 
through the origin was used as a calibration curve to correct 
small fragments. Large fragments were adjusted to maintain 
the total size. Fig. 20C shows the results after correction. 
35 Fig. 20D shows the relative apparent length results. 

Because the digest is imaged after the fact, images of 
many molecules can be collected from a single sample in a 
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short time. This makes averaging results to reduce noise 
very feasible. Obviously, averaging cannot improve the 
situation if the initial measurements are so noisy that 
different fragments cannot be distinguished. For lambda 
5 clones, the -11 kb size difference between the cloning arms 
(20 and 9 kb) makes distinguishing one end of the molecule 
from the other trivial even when the noise approaches two 
standard deviations. 

10 Optical Maps of Lambda Clones . 

Fig, 19 shows the EcoRI and BamH I maps constructed by 
Optical Mapping, of Lambda FIX II clones derived from the 
mouse Pygmy locusl9. Table 1 shows the fragment sizes. Fig. 
20 shows typical cleavage patterns by enzymes which cut at 

15 the polylinker site and therefore permit absolute size 
calculations based on the known size of the vector arms 
rather than on PFGE measurements of uncut clones. Table 2 
shows results from PFGE, fluorescence intensity, and apparent 
length measurements of digests with enzymes (Sal I, NotI or 

20 SstI) which cut at the polylinker site. Optical mapping with 
these enzymes permits calculation of the total size of the 
clone. This value can then be used to calculate sizes for 
Optical Mapping with enzymes that do not cut the polylinker. 
Ordered restriction endonuclease maps were constructed 

25 using procedures developed in Example 13, above. Briefly, 
the correct number of fragments by constructing a histogram 
for each clone consisting of the number of imaged restriction 
fragments per parental molecule, and its frequency. 
Generally 100 molecules of each clone were analyzed, and 5-10 

30 molecules were selected for map construction based fragment 
number and map content. Usually these molecules originated 
from histogram bins containing the maximum number of 
restriction fragments. Studying molecule images after 
digestion provided fragment order, and relative fragment 

35 masses were assigned by relative fluorescence intensity and 
relative apparent length measurements. Fragment lengths were 
measured starting at the midpoint of the gap between 
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fragments. The final map is reported as an average of 
restriction fragment sizes derived from similar molecules. 
Molecules were considered similar if the fragment number 
agreed and homologous fragment sizes were within the stated 
5 measurement precision. 

The histogram analysis of the numbers of cut sites for 
each molecule was necessary because small numbers of 
molecules were analyzed and digestion efficiencies were not 
entirely quantitative. Typically it was found that 5-30% of 

10 imaged molecules were fully digested. The efficiency varied 
with fragment number, size and pattern. Contiguous 
restriction fragments below 1 . 5 kb were sometimes 
indistinguishable. Fragments less than 1 kb sometimes broke 
free from the surface and were not observed. It is expected 

15 that these problems would be obviated by imaging sufficient 
numbers of molecules. Additionally, data from partially 
digested clones were used to confirm maps created from fully 
digested molecules. 

Data from partially cut molecules or from fully cut 

20 molecules with defective images was sometimes useful. When 
some but not all fragments could be measured, or when a 
fragment could be unambiguously interpreted as a particular 
partial digestion product, the ratios of the known fragments 
to all combinations of sums of fragments were calculated and 

25 averaged for all available data. These ratios were also 
calculated from fully cut perfectly imaged molecules. Some 
fully cut molecules could not be used directly for intensity 
calculations because one of the vector arm fragments was 
contaminated with intensity that clearly did not belong to 

30 the fragment or because the fragment extended over the edge 
of the image. Similarly, some fragments could not be used 
for length calculations. In those cases, a full set of 
fragment sizes was calculated for the molecule by using 
ratios of unknown fragments to known fragments. 

35 The maps were first constructed by optical mapping and 

then confirmed by gel electrophoresis data generated in this 
laboratory and compared to the previously constructed contig 
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maps. Optical lapping requires an internal size standard: 
the uncut clone or clearly identifiable fragments such as the 
vector arms. For enzymes which do not cut the polylinker, 
gel data was used to size the uncut clone. These sizes were 
5 also obtained by Optical Mapping using enzymes (Not I, Sal I, 
and Sst I) which cut the polylinker (Table 2, Fig 20) . 
Overall, the agreement between electrophoresis based maps and 
optical maps was excellent in terms of fragment size and 
order. Frequently it was found that the optical maps more 
10 accurately reported fragment sizes than agarose gel 

electrophoresis based measurements, particularly when data 
from 10 molecules were averaged. Given the level of sizing 
precision, fragments below 800 bp were not reliably detected. 
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Clone 



Table 1 . Ordered restnction maps lor 28 lambda clones 
EcoR 1 

restriction fragment lengths 



1004 


9.5 


10.2 


4.3 


22.0 






11.1 


4.7 


4.2 


26.6 




202 


9.5 


4.5 


2.0 


4.0 


21.5 


oUb 


1 1.9 


7.3 


2.9 


23.5 




A 

A 


12.8 


1 1.4 


20.8 






B 


17.7 


2.3 


3.3 


23.6 






12.2 


2.8 


4.2 


22.8 




n 
U 


1 1 .4 


8.3 


3.7 


1.9 


24.4 


c 
c 


10.5 


9.5 


1.8 


2.5 


2.5 


r 

r 


1 0.2 


0.7 


2.2 


1.0 


2.9 


U 


1 1 .0 


1.9 


4.2 


3.2 


2.5 


H 


1 1.5 


1.8 


4.1 


3.8 


1.8 


103 


10.0 


8.2 


23.8 






208 


10.5 


1.6 


4.2 


2.3 


21.0 


617 


15.3 


2.5 


1.0 


27.6 




618 


15.7 


2.5 


27.6 






(704 


10.5 


2.0 


4 4 


0 7 




I914 


16.1 


2.2 


27.8 






Yn 


11.6 


4.2 


2.5 


c 




. Y41 


124 


5,6 


4 1 


26 


24 c 


■ A1 


15.1 


1.7 


1.2 


2 8 


25 ^ 


■A2 


13.7 


2,6 


1 6 


1 3 


24 7 


;ei 












.63 


12.0 


30 0 








B4 


9.5 


8.0 


1.6 


1.5 


2 1 


B6 


11.2 


1.6 


9.7 


1.8 


3.0 


B7 


12.7 


4.4 


1.6 


1.5 


1.0 


C3 


11.6 


2.6 


3-8 


2.3 


22,6 



22.1 
21.7 
21.2 
22.7 



22 1 
22.5 
21.9 



BamH I 
restriction fragment lengths 



10.7 
16.6 



10.9 

23.5 

18.8 
13.5 
22.4 
14.2 



1G 3 
105 
14.5 



6.2 
30.4 



7.4 
23.5 

3,0 
9.0 
24.3 
2.0 



9 2 
11 3 

22.5 



1.7 27.4 



6.1 20.6i 



27.9 

26.4 I 

i 

3.6 24,o' 



20 5 
22 5 



30 



35 
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Table 2- Sizes of insert DNA of lambda clones by PFGE and Optical Mapping 



Clone (Enzyme*) 

1004 (N) 

602 (N) 

202 (N) 

305 (S) 

A(S) 

B(N) 
|C(S) 
|0(S) 

!b(S) 

|F(S) 
|G(S) 
;h(S) 

103 (S) 

208 (S) 

617 (S) 

618 (S) 
704 (S) 
9'.4 (S) 
Yn (5/ 

Y41 (S) 
Al (T; 

A2fT) 
Bl (T) 
B3 {T, 
84 (N) 

86 (N) 

87 (N) 
C3 (N) 



PFGE (kb)t 

17.0 ±0.9 
17,6 ±0.9 

12.5 ±0.6 

16.6 ±0.8 
16.0 ±0.8 
17.9 ±0.9 
13.0 ±0.7 

20.7 ± 1.0 
19.9± 1.0 

17.7 ±0.9 
15.0 ±0.8 
16.7±0.8 

13.0 ±0.7 
10.6 ±0.5 
17.4 ±0.9 

16.8 ±0.8 
12.8 ±0.6 

17.1 ± 0.9 
15.3 ±0.8 
20.3 ± 1.0 
17.0 ± 0.9 
15.3±0.8 
8.0 ± 0 4 

13.0 ±0.7 
15.8 ±0.8 

20.8 ± 1.0 

14.1 ±0.7 

13.9 ±0.7 



Optical Mapping (kb)§ 


Intensitv 


Lenyin 


16.2 ±0.9 


1£ A 4. 1 rt 
1 0.U Z 1 .u 


16.5± 1.0 




13.2 ± 0.8 


1 T n r\ c 


17.4 ± 1.0 


1 7 O a. 1 1 

1 / Z I.I 


15.2 ± 0.9 


1 ^ n -4. 1 n 


17.5±0.9 


1 7 ft + 1 n 


12.5 ± 0.8 




20.0 ±1.1 




20.6 ± 1 .2 




1 7.0 ± 0.9 


lfi ft + n Q 


17.0 ±0,9 


172 ±0.9 


17.6 ±0.8 


17.7 ±0,8 


14.0±0.8 


13.6 ±0.9 


9.7 ± 0.7 


9.4 ± 0.8 


18.2 ±0-9 


18.6 ± 1.0 


16.0 ± 0.8 


15.5 ±0.6 


14.0±0.8 


13-6 ±0.7 


18.0z 0 7 


18.4 ± 0.9 


16.2 1 0.6 


16.5 10.8 


19.1 ±09 


19.5± 1.0 


16 4 r 0.9 


15.2 1 0.9 


16.0^0.9 


16 4 1 0.9 


8.6 ±05 


93 ±05 


13.8 10.9 


13.7 1 0.9 


15.0 ±0.9 


15.2 1 0.9 


20 1 ± 1.3 


20.0 ± 1.5 


14.7 ± 0.8 


15.0 + 0.8 


13.1 ±0.7 


13.3 ±0.7 



30 



j • Enzymes N: Not I, S: Sal I, T: Sst I. 
j t PFGE size ± assumed 5% sizing eaor 

L§_Fluorescence intensftv and apparent length t 90% confidence interva l on mean 
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EXAMPLE 15: 

Ordered Restriction Endonuclease Maps of Yeast Artifical 
Chromosome Created by Optical Mapping on Surfaces 

In this Example, a new surface mounting technology for 

5 the rapid construction of ordered restriction maps from 

individual DNA molecules is described. Specifically, such 

technology involves the utilization of polylysine- coated 

derivatized glass surfaces The successful use of this 

technology is demonstrated by the accurate optical 

10 restriction maps constructed from yeast artificial chromosome 

DNA molecules mounted on the derivatized glass surfaces. 

15.1. MATERIALS AND METHODS 

YACs. DNA prenaration and PFGE reafriction map ping Gel 

15 inserts were prepared from five YAC clones (Murray and 
Szostak, Nature 305:189-93, 1983; Burke et al . , Science 
236:806-812, 1987) named 7H6, 314, 3H5, 5L5 and 6H3 and yeast 
strain AB972 following the standard protocol (Schwartz and 
Cantor, Cell 37: 67-75, 1984; Ausubel et al . , eds., in 

20 Current Protocols in Molecular Biology, Vol. 1, 6.10.1-5, 
John Wiley & Sons, New York, NY, 1994). Pulsed field gel 
electrophoresis (PFGE) was performed on an ED apparatus 
(Schwartz et al . , Nature 342:575-576, 1989). YAC sizes were 
measured by comparing relative electrophoretic mobilities to 

25 lambda DNA concatamers and yeast chromosomes. PFGE maps of 
7H6 and 314 were constructed by Southern blotting YAC DNAs 
cut with different restriction enzymes. Blots were 
hybridized (Church and Gilbert, Proc. Natl. Acad. Sci. USA 
81:1991, 1984) with radiolabelled human Alu repeat probe. 

30 Ordered maps of 3H5, 5L5 and 6H3 were constructed by partial 
digestion (Smith and Birnstiel, Nucleic Acids Res. 
3:2387-2399, 1976) using probes derived from the right and 
left cloning arms. 

Surface preoararion. DNA m ounting, and di a<=>.eit^ i 

35 Glass coverslips were cleaned in excess 3 M HCl at 95°C 

for 2 hours and then thoroughly washed with high purity 
water. Cleaned glass coverslips were derivatized by 
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immersion for varying lengths of time in freshly prepared 
0.10 M 3-aminopropyltriethoxysilane (APTES; Sigma), pH 3.5, 
at 65°C. After APTES treatment, coverslips were washed 
thoroughly with high purity water and air dried. In order to 
5 create a chamber for DNA mounting glass microscope slides 
were drilled to create a 1 cm diameter hole which was then 
sandwiched between two coverslips. First the APTES treated 
coverslip was attached with silicone vacuum grease. Then 20 
Ml of DNA in molten agarose gel were slowly spread onto the 

10 APTES derivatized surface with a pipetman. The top of the 
chamber was then quickly sealed with an untreated coverslip 
using vacuum grease. Chambers were incubated on a 45°C 
heating block for 10-30 minutes to allow DNA in the molten 
agarose to transfer to the derivatized glass surface. 

15 Slightly tilting the chambers generated a mild fluid flow and 
helped to stretch out the DNA during transfer. After 
transfer, chambers were chilled at for 5 minutes to set 

the gel. Then the chambers were opened and 3-5 units of 
restriction endonuclease , diluted in appropriate buffer, was 

20 added to the gel surface. Chambers were resealed and 

incubated 1-2 hours at 37oc. After digestion, samples were 
stained either with ethidium homodimer {Molecular Probes, 0.1 
ng/ml ethidium homodimer, 15 mM EDTA (pH 7.5) and 10% 
2-mercaptoethanol) or oxazole yellow homodimer (YOYO-1) 

25 (Molecular Probes, O.lng/ml YOYO-1, 15 mM EDTA, pH 7.5, and 
20% 2-mercaptoethanol) . Dilution of high molecular weight 
DNA. It is important to control DNA concentration when 
mounting DNA molecules for optical mapping. DNA molecules 
excised as bands from low melting temperature PFGE gels 

30 (Seaplaque, FMC) often must be diluted before mounting, as 
follows: Incubate gel band for 2 hours in 0.01 mM spermine 
tetrachloride (Sigma) in TE buffer (10 mM Tris, pH 7 . 6 ; 1 mM 
EDTA) . This step condenses the gel -embedded DNA molecules 
into shear resistant particles, protecting them during 

35 dilution. Next melt gel bands at 12^C for 7 minutes and mix 
with additional molten low melt agarose containing 0.01 mM 
spermine. Vortexing at this step causes little apparent 
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breakage. Diluted samples are made into gel inserts 
(Schwartz and Cantor, Cell 1984) which are then 

washed 5 times, 3 0 minutes each with shaking, with TE buffer 
to remove the spermine and thereby decondense the DNA 
5 particles. The first wash is in TE supplemented with 100 mM 
NaCl. Gel inserts were stored in 10 mM Tris, 0 . 5 mM EDTA, pH 
7.6. 

Microscopy. image analysis and map construction. DNA 
molecules were imaged using a Zeiss Axioplan or Axiovert 135 

10 microscope equipped for epi- fluorescence (filter pack for 
green excitation and red emission) and a lOOX Plan-Neof luar 
objective (Zeiss) coupled to a Hamamatsu C2400 SIT camera 
(Example 13). A typical 100 micron microscopic field 
contained three to five molecules suitable for analysis. 

15 Efficiency of restriction endonuclease digestion was scored 
by counting gaps in molecules with known restriction maps. 
Digestion efficiencies did not differ among the enzymes used 
in this study. Restriction maps were constructed as 
described in Example 13 . 

20 

15.2 RESULTS 

Optimizing mounting conditions for large DNA molecules 
on derivatized glass surfaces. Large DNA molecules are 
easily broken during transfer (Albertsen et al . , Proc . Natl. 

25 Acad. Sci. USA 87:4256-60, 1990) and maintaining their 

integrity during surface mounting operations required special 
effort. Molten agarose has been used to mount, with high 
efficiency, DNA molecules greater than 1 megabase in size 
(Example 13), but it is sometimes difficult to bring an 

30 entire molecule into sharp focus and the agarose gel scatters 
light. To eliminate these problems with agarose fixation, 
the fluid turbulence damping properties of molten agarose 
were combined with the stability of surface mounting by 
fixing large DNA molecules dissolved in molten agarose onto 

35 APTES derivatized glass surfaces (Lyubchenko et al . , J. of 
Biomolecular Struct, and Dynamics 10:589-606, 1992; Weetal, 
Methods Enzymol . 44:19, 1976). It was reasoned that this 
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combined technique would enable high contrast imaging, since 
it would minimize the amount of agarose gel between DNA 
molecules and the microscope objective. This approach was 
evaluated by testing whether DNA in an agarose matrix could 
5 interact with an APTES modified glass surface to produce 
optimally elongated and stabilized molecules in an 
environment conducive to restriction endonuclease activity. 

Surface derivitization conditions affect two important 
aspects of DNA fixation: molecular adhesion and elongation. 

10 Ideally molecules should be tightly attached and well 

stretched out. In fact these two conditions are antagonistic 

too much adhesion will prevent elongation, whereas too 
little may allow optimal elongation but will not fix 
sufficient numbers of molecules to the surface. To achieve a 

15 suitable balance, the amount of APTES was titrated on the 
surface against the measured average molecular length of 
deposited molecules. Fluorescence microscopy was used to 
image stained molecules on APTES modified glass coverslips. 
The incubation time of cleaned glass coverslips in a 0.10 M 

20 APTES solution was varied from 0 . 5 to 5 hours, deposited 

undiluted Saccharomyces cerevisiae {AB972) chromosome I (240 
kb> in molten agarose, and measured molecular lengths from 
fluorescence micrographs. The number of molecules attached 
to the surface was also counted. The goal was to maximize 

25 molecular extension while maintaining a usable number of 
molecules on the surface. Fig. 21 shows a plot of APTES 

concentration versus average molecular extension and number 
of molecules per 100 m^ field. At low APTES concentration, 
the average molecular extension as well as the number of 

30 molecules detected on the surface was minimal. The average 
molecular extension increased with APTES concentration and 
peaked at 3 hours; further increase in APTES concentration 
reduced molecular extension and, predictably, increased the 
number of molecules on the surface. It is not known exactly 

35 how large DNA molecules interact with an APTES modified 
surface. One may speculate that attractive electrostatic 
forces between DNA and the charged surface are balanced, to 
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some extent, by the molecular flow forces generated during 
the mounting procedure. The surface charge density increases 
as more APTES is deposited, while flow forces remain 
constant. Thus, minimum molecular extension should be 
5 measured at high and low APTES surface densities. Based on 
the data shown in Fig. 21, it was initially decided to use 
glass surfaces incubated in APTES for 3 hours. This 
incubation time was found to produce a uniform extended 
length distribution; however, the molecules relaxed 

10 excessively during a 2 hour digestion. The APTES incubation 
time was then extended to 5 hours. At 5 hours, the mean 
length is roughly 55% that of the polymer contour length. A 
high degree of elongation facilitates the detection of small 
restriction fragments, but may inhibit restriction 

15 endonuclease activity. The next step was to assay 

restriction endonuclease activity {Example 13) . Digestion 
of Mounted DNA Molecules. In previous optical mapping 
studies DNA molecules were typically elongated to roughly 30% 
of their polymer contour length. This degree of elongation 

20 was chosen to optimize image contrast: more condensed 
molecules have a higher fluorochrome density. Recently, 
longer image integration times were used to collect adequate 
information from lower density images. In this Example, 
surface mounted molecules were typically extended to 50-60% 

25 of their polymer contour length. It was found that such 
molecules were more effectively cleaved by restriction 
endonucleases than more condensed molecules mounted in 
agarose: 85% versus 50%. Efficiency was measured as the 
probability of cleavage at a given cognate site (Example 13) . 

30 The overall image quality was greatly improved as well. 

Mounting DNA molecules on a surface has a drawback -- 
not only does most of the DNA in the molten agarose stick to 
the surface, fluorescent debris sticks as well. Thus, the 
DNA concentration had to be lowered since observation was 

35 limited to a single optical plane. A shear-free dilution 

protocol was developed based on spermine condensation (Gosule 
and Schellman, J. Mol . Biol. 121:311-326, 1978). The 
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protocol successfully collapses DNA coils embedded within 
agarose so that molten agarose can even be vortexed without 
significant DNA breakage. The spermine DNA 
condensation/sample dilution step was used for all YAC 
5 samples. After dilution, spermine was removed by washing gel 
inserts in excess TE buffer. 
Mass Determination. 

A quantitative relationship between mass and the 
measured fluorescence intensity of a labeled DNA molecule, as 

10 imaged by fluorescence microscopy was previously demonstrated 
(Example 13) . Additionally, a reliable relationship between 
microscopically imaged restriction fragment length and mass 
was established (Example 13) . These studies were performed 
using DNA molecules fixed in agarose gel. Since the surface 

15 mounting conditions described in this example are different, 
the methods for mass determination had to be reevaluated. 
Surface mounted §^ cerevisiae chromosomal DNA molecules were 
digested with NotI and restriction fragment fluorescence 
intensity and length was measured. These measurements were 

20 plotted against the well established NotI fragment sizes of 
S^ cerevisiae chromosomes (Example 13; Link and Olson, 
Genetics 127:681, 1991) (see Figs. 22 and 23). Fluorescence 
micrographs of typical molecules are shown in Fig. 23. The 
most notable difference between fluorescence intensities 

25 measured for surface mounted molecules vs. gel mounted 

molecules (Example 13) is improved reproducibility: pooled 
standard deviation (SD) is 17 kb vs. 36 kb previously shown 
(in Example 13) . Also, the fluorescence intensity data on 
surface mounted molecules is accurate down to 3 0 kb, whereas 

30 our previous gel mounting protocol gave poor results below 60 
kb. The overall relative error with surface mounted 
molecules was 4%, identical to results obtained by standard 
methods (Link and Olson, Genetics 127:681, 1991) and the 
average of the coefficients of variation was 12%, indicating 

35 precision comparable to routine PFGE analysis. Mass 
determination by measuring length of surface mounted 
molecules is also superior to previous results with gel 
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mounted molecules. The length measurements showed a pooled 
SD of 32 kb vs. 47 kb and the average of the coefficients of 
variation was 29%. The relative error was 1\. which was not 
as accurate as the fluorescence intensity data. These 
5 fragment sizing studies show that fluorescence intensity is 
more accurately and reliably correlated to mass than length. 
Overall, the images of surface mounted molecules were 
consistently in one focal plane. Good focus is essential for 
accurate fluorescence intensity measurements, whereas length 

10 measurements are less subject to error due to blurry images. 
Apparently, restriction fragments produced by digestion of 
surface mounted molecules vary in length more than 
fluorescence intensity values. Errors caused by length 
variation could be reduced by selecting only uniformly 

15 elongated DNA molecules. 

Improved images with YOYO-1. 

New fluorochromes with improved DNA binding efficiencies 
and quantum yields have been developed recently. Oxazole 
yellow homodimer (YOYO-1) vs. ethidium homodimer were tested 
20 to optically map YAC clones 3H5, 5L5 and 6H3 . The YOYO-1 
images were brighter and of higher contrast than those made 
with ethidium homodimer. Also, while high salt conditions 
diminish the fluorescence emission of ethidium stained 
molecules, YOYO-1 stained molecules retain luminosity in high 
25 salt and under severe fixation conditions. Interestingly, 
serious photodamage to DNA was observed in solution with 
YOYO-1, manifested as double strand breaks, even in the 
presence of 2 -raercaptoethanol . Fortunately, surface mounted 
YOYO-1 stained DNA molecules had no measurable photodamage 
30 (double-strand breaks) in the presence of 20% (v/v) 

2-mercaptoethanol. Additional 2 -mercaptoethanol was found to 
quench YOYO-1 fluorescence. The qualitatively superior image 
contrast attainable with YOYO-1 improved restriction fragment 
sizing results: the pooled standard deviation on the means 
35 calculated for YOYO-1 stained restriction fragments dropped 
to 11 kb from 17 kb and the average coefficient of variation 
decreased to 7% from 12%. 
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Restriction Dige stion and Map Construction . 
Five YACs were optically mapped with restriction 
endonucleases Mlul, EagI, Nrul and NotI using the optimized 
APTES fixation and YOYO-1 staining conditions described 
5 above. In general, images were clear and high contrast. 
Maps were constructed using previously described procedures 
(Example 13) , with minor modifications to exploit the 
potential of high contrast imaging. The analysis necessary 
for map construction was simplified, in comparison to the 

10 previous approach, since molecules were imaged after 

digestion. Long image integration times were used, and only 
one image was collected per microscope field. Previous 
procedures (Example 13) required the examination of a series 
of time lapse images and the analysis of 4-5 contiguous 

15 (temporal) images. The cleavage sites of surface mounted 
molecules were flagged by the appearance of gaps, and 
fragment ends occasionally displayed bright regions of 
condensed DNA. To orient these maps, the YACs were further 
characterized by double digests. Some of the resulting maps 

20 include as many as 6 fragments ranging in size from 40-180 
kb. The overall agreement between optical and PFGE maps was 
excellent, in terms of both fragment sizing and ordering. 

15.3. DISCUSSION 

25 Optical restriction mapping of DNA molecules is a new 

alternative to conventional gel and hybridization based 
methods for producing restriction maps of large DNA 
molecules. Optical mapping is an attractive technology based 
on the following considerations: i) it is rapid and safe, not 

3 0 requiring time consuming procedures such as gel 

electrophoresis, preparation and radiolabelling of probes, 
nucleic acid hybridization and autoradiography. Further, it 
is an easy and inexpensive technique to perform, requiring - 
apart from the microscope and camera - very small quantities 

35 of very simple materials. ii) The technique yields 

consistent results, the accuracy of which has been proven by 
direct comparison with standard methods. iii) The technique, 
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because it analyzes individual DNA molecules, holds enormous 
potential for miniaturization and automation and consequent 
order of magnitude increases in throughput and decreases in 
cost . 

5 This example describes several important improvements to 

optical mapping that derive from the ability to analyze DNA 
molecules adhered to APTES derivatized glass surfaces. 
First, with surface mounting it is easier to find large 
molecules in one focal plane. This simplifies the analysis 

10 necessary for map construction since, in contrast to the 
previous approach, molecules are imaged after digestion. 
Second, the longer imaging times possible with surface 
mounting allow DNA molecules to be extended up to 60% of 
their polymer contour length (vs. 30% previously). The more 

15 extended molecules are more efficiently cleaved by 

restriction endonucleases : 85% of sites are cut {vs. 50% 
previously) . Thus the basic mechanics of the technique are 
more robust. A valuable consequence is that fluorescence 
intensity-based length data are more reproducible and 

20 accurate. A third benefit of surface mounting compared to 
agarose gel fixation is that small DNA fragments are more 
readily detected because surface mounting restrains their 
tendency to relax back into the gel matrix and disappear from 
view. Reliable measurements are now possible for molecules 

25 as small as 30 kb (vs. 60 kb previously) . A fourth 

improvement results from the superior performance of the 
fluorochrome YOYO-1 compared to ethidium homodimer. YOYO-1 
produces clearer images of higher contrast and, unlike 
ethidium homodimer, is unimpeded by high salt. The improved 

30 images contribute to more reliable DNA fragment sizing as 
measured by lower standard deviations on mean restriction 
fragment sizes , 

Presently, a large fraction of the human genome is 
covered by YAC contigs (Cohen and Weissenbach, Nature 

35 366:698-701, 1993). The information content of most contigs 
consists of a list of sequence tagged sites or other markers, 
the YACs associated with each marker and in some cases the 
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sizes of the YACs . In general there is little detailed YAC 
characterization and as a result it is difficult to assess 
the true physical distance spanned by most contigs. Further, 
as physical landmarks become more closely spaced it becomes 
5 more difficult to correctly order them using YAC libraries 
because nearby markers will often be contained in identical 
sets of YACs, or YAC rearrangements may give contradictory 
data . 

Restriction mapping is unique among the techniques 

10 available for YAC characterization in providing a truly 
linear, sequence based representation of DNA content. 
Restriction maps of overlapping YACs are also useful for 
sorting out YAC overlap, DNA rearrangement and chimerism. 
Finally, an ordered restriction map (or maps, using several 

15 enzymes) can be treated as a complex fingerprint and used as 
a tool in map construction, similar to the use of cosmid 
fingerprinting (Stallings et al . , Proc . Natl. Acad, Sci . USA 
87:6218-22, 1990). Such a fingerprint is considerably more 
complex and reproducible than fingerprints generated by 

20 hybridizing digested YAC DNA with repeat sequences. 

It is evident from relatively advanced sequencing 
projects in lower organisms that an ordered restriction map 
is an essential prelude to more detailed studies of DNA 
sequence. Perhaps because of the extensive labor required, 

25 human YAC restriction maps based on PFGE have not been 

produced on a large scale. The dramatic simplification and 
increase in speed offered by optical mapping makes the 
prospect of detailed restriction maps covering large 
continuous segments of a complex genome an attainable goal. 

30 Optical mapping makes it possible to address directly some of 
the artifacts of YAC cloning. Yeast strains with two or more 
co-cloned YACs can be effectively analyzed by optical 
mapping. Yeast strains with unstable YACs in which only a 
fraction of the yeast contain full length molecules can also 

35 be effectively mapped optically. The analysis of genomic 
regions prone to rearrangement will also be facilitated by 
optical mapping because of the ease of analyzing multiple 
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YACs with multiple enzymes. Optical mapping is likely to be 
equally useful in analyzing other large insert clones such as 
PI, PI artificial chromosome (PAC) and bacterial artificial 
chromosome (BAC) clones and ultimately in generating accurate 
5 detailed restriction maps for large portioins of the human 
genome . 

All references cited herein, including journal articles 
or abstracts, published or corresponding U.S. or foreign 

10 patent applications, issued U.S. or foreign patents, or any 
other references, are entirely incorporated by reference 
herein, including all data, tables, figures, and text 
presented in the cited references. 

Reference to known method steps, conventional methods 

15 steps, known methods or conventional methods is not in any 
way an admission that any aspect, description or embodiment 
of the present invention is disclosed, taught or suggested in 
the relevant art. 

The foregoing description of the specific embodiments 

20 will so fully reveal the general nature of the invention that 
others can, by applying knowledge within the skill of the art 
(including the contents of the references cited herein), 
readily modify and/or adapt for various applications such 
specific embodiments, without undue experimentation, without 

25 departing from the general concept of the present invention. 
Therefore, such adaptations and modifications are intended to 
be within the meaning and range of equivalents of the 
disclosed embodiments, based on the teaching and guidance 
presented herein. It is to be understood that the 

30 phraseology or terminology herein is for the purpose of 

description and not of limitation, such that the terminology 
or phraseology of the present specification is to be 
interpreted by the skilled artisan in light of the teachings 
and guidance presented herein, in combination with the 

35 knowledge of one of ordinary skill in the art. 
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WHAT IS CLAIMED IS : 



1. A nucleic acid molecule elongated and fixed 
onto a planar surface so that the nucleic acid molecule 
5 remains accessible for enzymatic reactions and/or 
hybridization reactions. 



2. The elongated 
Claim 1 in which the nucleic 

10 

3 . The elongated 
Claim 1 in which the nucleic 



fixed nucleic acid molecule of 
acid molecule is a DNA molecule 

fixed nucleic acid molecule of 
acid molecule is a RNA molecule 



4. The elongated fixed nucleic acid molecule of 
15 Claim 1 in which the planar surface is derivatized glass. 

5. The elongated fixed nucleic acid molecule of 
Claim 4 in which the glass surface is derivatized by a 
coating of a charged substance that increases the 

20 electrostatic interaction between the nucleic acid molecule 
and the surface, at a charge density sufficient to maintain 
the nucleic acid molecule in an elongated state while 
allowing for a small degree of relaxation. 

25 6. The elongated fixed nucleic acid molecule of 

Claim 5 in which the changed substance is poly-D-lysine or 3 
aminopropyltriethoxysilane . 



7. The elongated fixed nucleic acid molecule of 
30 Claim 1 which is fixed in a gel. 

8. The elongated fixed nucleic acid molecule of 
Claim 3 in which the gel is agarose or polyacrylamide . 

35 9. The elongated fixed nucleic acid molecule of 

Claim 1 in which the planar surface further includes an 
enzyme fixed onto the surface. 
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10. The elongated fixed nucleic acid molecule of 
Claim 9 in which the enzyme is a restriction endonuclease, an 
exonuclease, a polymerase, a ligase or a helicase. 

5 11. The elongated fixed nucleic acid molecule of 

Claim 10 in which the planar surface further includes a 
chelated cofactor required for the activity of the fixed 
enzyme . 

10 12. The elongated fixed nucleic acid molecule of 

Claim 11 in which the chelated cofactor is released upon 
exposure to a specific wavelength of light, and the fixed 
enzyme is activated in the location of the exposure. 

15 13. A method of preparing an elongated nucleic 

acid molecule fixed onto a planar surface, comprising 
depositing the nucleic acid molecule onto a planar glass 
surface coated with a charged substance that increases the 
electrostatic interaction between the nucleic acid molecule 

20 and the surface, the charge density being sufficient to 

maintain the nucleic and molecule in an elongated state while 
allowing for a small degree of relaxation. 

14. The method of Claim 13 in which the charged 
25 substance is poly-D-lysine, or 3 -aminopropyltriethoxysilane . 

15. The method of Claim 13 or 14 in which the 
nucleic acid molecule deposited on the planar surface is in a 
solution containing glycerol. 

30 

16. A method of preparing an elongated nucleic 
acid molecule fixed onto a planar surface comprising applying 
external force to the nucleic acid molecule within a non- 
polymerized gel composition on the planar surface, so that 

35 the nucleic acid molecule is elongated, and upon 

polymerization of the gel, the elongated nucleic acid 
molecule is fixed in place prior to excessive relaxation, and 
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is maintained in an elongated, relatively stationary 
position, yet accessible for enzymatic reaction and/or 
hybridization reactions. 

5 17. The method of Claim 16 in which the gel is 

agarose, or polyacrylamide . 

18. The method of Claim 16 in which the elongation 
is accomplished by physical compression. 

10 

19. The method of Claim 16 in which the elongation 
is accomplished by electrical force. 

20. The method of Claim 13 or 16 in which the 
15 planar surface further includes an enzyme fixed onto the 

surface . 

21. The method of Claim 20 in which the enzyme is 
a restriction endonuclease, an exonuclease, a polymerase, a 

20 ligase or a helicase. 

22. The method of Claim 21 in which the planar 
surface further includes a chelated cofactor required for 
activity of the fixed enzyme. 

25 

23. The method of Claim 22 in which the chelated 
cofactor is released upon exposure to a specific wavelength 
of light, and the fixed enzyme is activated in the location 
of the exposure . 



30 



35 



24 . A method for characterizing a nucleic acid 
molecule, comprising imaging an elongated and fixed nucleic 
acid molecule of Claim 1 to obtain its physical 
characteristics . 

25. The method of Claim 24 in which the nucleic 
acid molecule is a DNA molecule. 
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26. The method of Claim 24 in which the nucleic 
acid molecule is an RNA molecule. 



27. The method of Claim 24 in which the planar 
5 surface is derivatized glass. 



28. The method of Claim 27 in which the glass 
surface is derivatized by a coating of a charged substance 
that increases the electrostatic interaction between the 
10 nucleic acid molecule and the surface, at a charge density 
sufficient to maintain the nucleic acid molecule in an 
elongated state while allowijig for a small degree of 
relaxation. 



15 29. The method of Claim 28 in which the glass is 

derivatized with poly-D-lysine or 3- 
aminopropyltriethoxysilane . 

30. The method of Claim 24 in which the elongated 
20 fixed nucleic acid molecule is fixed in a gel. 

31. The method of Claim 30 in which the gel is 
agarose or polyacrylamide . 

25 32. The method of Claim 24 in which the planar 

surface is a glass slide and the imaging is accomplished 
using an optical microscope. 

33. The method of Claim 32 in which the image is 
30 computer enhanced. 

34. A method for characterizing a nucleic acid 
molecule , comprising : 

(a) reacting an elongated fixed nucleic acid 
35 molecule of Claim 1 with an enzyme that 

modifies nucleic acid molecules; and 
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(b) imaging the elongated fixed nucleic acid 

molecule to detect a change in its physical 
characteristics . 

f 5 35. The method of Claim 34 in which the nucleic 

^ acid molecule is a DNA molecule. 

I 

36. The method of Claim 34 in which the nucleic 
acid molecule is an RNA molecule. 

37. The method of Claim 34 in which the enzyme is 
a restriction endonuclease . 

38. The method of Claim 34 in which the enzyme is 
15 an exonuclease. 

■'^ 39. The method of Claim 34 in which the enzyme is 

a polymerase . 

20 40. The method of Claim 34 in which the enzyme is 

a ligase. 

41. The method of Claim 34 in which the enzyme is 
a helicase. 

25 

42. The method of Claim 34 in which the planar 
surface is a glass slide and the imaging is accomplished 
using an optical microscope. 

30 43. The method of Claim 42 in which the image is 

^ computer enhanced. 

44. A method for characterizing a nucleic acid 
molecule , comprising : 
35 (a) hybridizing an elongated fixed nucleic acid 

molecule of Claim 1 with a single-stranded 
^ nucleotide ; and 
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(b) imaging the elongated fixed nucleic acid 
molecule to detect hybridization reaction 
products . 

5 45. The method of Claim 44 in which the elongated 

fixed nucleic acid molecule is a DNA molecule. 

46. The method of Claim 44 in which the elongated 
fixed nucleic acid molecule is an RNA molecule. 

10 

47. The method of Claim 44 in which the single - 
stranded nucleotide is labeled. 

48. The method of Claim 47 in which the label is a 
15 radiolabel, a fluor, a colorimetric dye or an enzyme. 

49. The method of Claim 44 in which the single- 
stranded nucleotide is an oligonucleotide primer, and a 
polymerase is added to the hybridization reaction. 

20 

50. The method of Claim 44 in which the planar 
surface is a glass slide and the imaging is accomplished 
using an optical microscope. 

25 51. The method of Claim 50 in which the image is 

computer enhanced. 

52. A kit for mapping or sequencing a nucleic acid 
molecule , comprising : 

30 (a) the nucleic acid molecule elongated and fixed 

onto a planar surface so that the nucleic acid 
molecule remains accessible for enzymatic 
reactions and/or hybridization reactions; and 
(b) an enzyme that modifies nucleic acids. 

35 

53. The kit of Claim 52 which further includes 
reagents for the enzymatic reaction. 
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54. The kit of Claim 52 in which the enzyme is a 
restriction endonuclease, an exonuclease, a polymerase, a 
ligase, or a helicase. 

5 55. The kit of Claim 52 in which the elongated 

fixed nucleic acid molecule is a DNA molecule. 

56. The kit of Claim 52 in which the elongated 
fixed nucleic acid molecule is a RNA molecule. 

10 

57. The kit of Claim 52 in which the planar 
surface is derivatized glass. 

58. The kit of Claim 57 in which the glass is 
15 derivatized by a coating of a charged substance that 

increases the electrostatic interaction between the nucleic 
acid molecule and the surface, at a charge density sufficient 
to maintain the nucleic acid molecule in an elongated state 
while allowing for a small degree of relaxation. 

20 

59. The kit of Claim 52 in which the elongated 
fixed nucleic acid molecule is fixed in a gel. 

60. The kit of Claim 59, in which the gel is 
25 agarose or polyacrylamide . 

61. A kit for mapping or sequencing a nucleic acid 
molecule , comprising : 

(a) the nucleic acid molecule elongated and fixed 
30 onto a planar surface so that the nucleic acid 

molecule remains accessible for enzymatic 
reactions and/or hybridization reactions; and 

(b) a nucleotide probe. 

35 62. The kit of Claim 61 which further includes 

reagents for the hybridization reaction. 
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63. The kit of Claim 61 in which the elongated 
fixed nucleic acid molecule is a DNA molecule. 

64. The kit of Claim 61 in which the elongated 
5 fixed nucleic acid molecule is a RNA molecule, 

65. The kit of Claim 61 in which the nucleotide 
probe is labeled. 

10 66. The kit of Claim 65 in which the label is a 

radiolabel, a fluor, a colorimetric dye or an enzyme. 

67. The kit of Claim 60 in which the nucleotide 
probe is an oligonucleotide primer. 

15 

68. The kit of Claim 66 which further includes a 
polymerase . 

69. A glass surface derivatized to fix an 

20 elongated nucleic acid molecule so that the nucleic acid 
molecule remains accessible for enzymatic reactions and/or 
hybridization reactions, in which the glass surface is coated 
with a charged substance that increases electrostatic 
interaction between the nucleic acid molecule and the 

25 surface, at a charge density sufficient to maintain the 
nucleic acid molecule in an elongated state while allowing 
for a small degree of relaxation. 

70. The derivatized glass surface of Claim 69 in 
30 which the charged substance is poly-D- lysine or 

3 - m inop ropy 1 1 r i e t hoxy s i 1 ane . 

71. The derivatized glass surface of Claim 69 
which further includes an enzyme fixed onto the surface. 

35 
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72. The derivatized glass surface of Claim 71 in 
which the enzyme is a restriction endonuclease, an 
exonuclease, a polymerase, a ligase or a helicase. 

5 73. The derivatized glass surface of Claim 71 in 

which the glass surface further includes a chelated cofactor 
required for activity of the fixed enzyme. 

74. The derivatized glass surface of Claim 73 in 
10 which the chelated cofactor is released upon exposure to a 

specific wavelength of light, and the fixed enzyme is 
activated in the location of the exposure, 

75. A kit comprising the derivatized planar glass 
15 surface of Claim 63, and a reagent used for depositing, 

elongating and fixing a nucleic acid molecule onto the glass 
surface . 

76. The kit of Claim 75 in which the reagent is a 
20 glycerol solution. 

77. The kit of Claim 75 further comprising 
reagents used in enzymatic reactions or hybridization 
reactions . 

25 

78. A system for characterizing a nucleic acid 
molecule , comprising : 

(a) the nucleic acid molecule elongated and fixed 
onto a planar surface so that the nucleic acid 

30 molecule remains accessible for enzymatic 

reactions and/or hybridization reactions; and 

(b) a device for imaging the elongated fixed 
nucleic acid molecule to obtain its physical 
characteristics . 

35 

79. The system of Claim 78 in which the planar 
surface further includes an enzyme fixed onto the surface. 
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80. The system of Claim 79 in which the enzyme is 
a restriction endonuclease , an exonuclease, a polymerase, a 
ligase or a helicase. 

5 81. The system of Claim 79 in which the planar 

surface further includes a chelated cofactor required for 
activity of the fixed enzyme, 

82. The system of Claim 81 in which the chelated 
10 cofactor is released upon exposure to a specific wavelength 

of light, and the fixed enzyme is activated in the location 
of the exposure . 

83. The system of Claim 78 in which the planar 
15 surface is a glass slide and the device for imaging is an 

optical microscope. 

84. The system of Claim 83 in which the glass 
surface is derivatized by a coating of a charged substance 

20 that increases the electrostatic interaction between the 
nucleic acid molecule and the surface, at a charge density 
sufficient to maintain the nucleic acid molecule in an 
elongated state while allowing for a small degree of 
relaxation . 

25 

85. The system of Claim 83 in which further 
includes a device for enhancing the image obtained in the 
optical microscope. 

30 86. The system of Claim 83 in which the device for 

enhancing the image obtained is a computer. 

87, The system of Claim 78 in which a plurality of 
elongated nucleic acid molecules are fixed to the planar 
35 surface in an ordered array to form a grid-like pattern. 
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88. The system of Claim 87 in which the device f( 
imaging the elongated fixed nucleic acid molecules includes 
device to adjust the x-y axis to position the fixed nucleic 
acid molecule to be imaged. 

5 

89. The system of Claim 88 in which the imaging 
device further includes an auto- focus. 
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Tliis inicnuiional a-port has not been c^ul)ll.shcd in a-Jijvci u I certain claims under Article l7C)fa) lor ihe foUtiwing ruasons: 

1. Q Claims Nus.: 

because Ihey relate to subject matter not required to be searched by this Aulhoriiy. namely: 



I I Claims Nos.: 

because ihey relate lo parts ot'iUe intcrnationai application that do not comply with the prescribed requirements to such 
an extent thai no meaningfui iniemaiional search can be carried out. specifically: 



3. I I Claims Nos.: 

iKcause tiicy arv dejiendent eiauns and aa- not drailed in accordance with tiic second and third sentences of Rule 6.4(a). 
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BOX II. OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA tbund multiple invedlions as loltows: 

This application conUms the Ibituwmg inveiuiuni or groups of inventions which arc not so linked as to form a single 
inventive concept under PCT Rule 13,1. In order for all inventions to be examined, the appropriate additional 
examination fees must be paid. 

I. Claims 1-33. drawn to elongated nucleic acid molecules and methods of preparation 

li. Claims 34-51 and 78-89, drawn lo a nieliiods and systems fur characterizing nucleic acids 

III. Claims 52-68, drawn to a melhods and kits for scquencmg and mapping 

IV. Claims 69-77, drawn tu a derivaiized glass surface and kit containing said glass surface 

The inventions listed as Groups I and II do not relate to a single inventive concept under PCT Rule 13.1 because, 
under PCT Rule 13.2, Uicy lack the same or corresponding special technical features for the following reasons; Groups 

I and IV do not involve imaging of changes in the physical characteristics of nucleic acids, as does Group II. Groups I. 

II and iV do not encompass meihods of mapping and sequencing, as does Group III. Group III does not encompass 
glass surfaces dcrivatized witJi poly lysine or 3-minopropytlriethoxysilanc. 
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